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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: SEIDAH et al. 



Docket No.: 



480848.9002* 



Serial No.: 



Unassigned 



Filed: 



Concurrently herewith 



Int'l appln No. 



PCT/CA99/01058 



Int'l filing date: 04 Nov 1999 



Title: 



MAMMALIAN SUBTILISIN/KEXIN ISOZYME SKI-1: A 
PROPROTEIN CONVERTASE WITH A UNIQUE CLEAVAGE 
SPECIFICITY 



******************** ******************************** 



PRELIMINARY AMENDMENT 



Box PCT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

Dear Sir: 

In connection with the above-identified application filed herewith, please 
enter the following preliminary amendment: 
IN THE CLAIMS: 

Please amend Claims 3, 10, 11, 15, 16, 21, 25, 28-30, 35 AND 36 as follows: 

3. The proteic fragment of claim 2, wherein said part has a molecular weight 
of about 14 Kda and forms a tight complex with the soluble fragment of SKI-1 [as 
defined in claim 1]. 

10. An isolated nucleic acid encoding a proteic fragment as defined in [any 
one of claims 4 to 6] claim 4 . 

11. A recombinant vector comprising the nucleic acid defined in [any one of 
claims 7 to 10] claim 7 . 
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15. A recombinant host cell comprising the recombinant vector deinfed in [any 
one of claims 1 1 to 14] claim 11 . 

16. A method of producing a proteic fragment of SKI-1 enzyme, which 
comprises the steps of: 

culturing a recombinant host cell expressing a nucleic acid as deinfed in 
[any one of claims 7 to 10] claim 7 in a cell growth and expression- 
supportive culture medium; and recovering said proteic fragment of SKI-1 
in the culture medium. 

21 . A method of inhibiting the activity of a substilisin-kexin isoenzyme named 
SKI-1, which comprises the step of contacting SKI-1 with the inhibitor [defined in any 
one] of [claims 4 to 6, 8 and 10] claim 4 or an isolated nucleic acid encoding the 
inhibitor . 

25. A peptide as defined in [any one of claims 22 to 24] claim 22 which is 
labelled. 

28. The use of a peptide as defined in [any one of claims 22 to 27] claim 22 
for monitoring the activity of a subtilisin-kexin isoenzyme named SKI-1. 

29. The use as defined in [any one of claims 22 to 27] claim 22 for screening 
inhibitors of a subtilisin-kexin isoenzyme named SKI-1. 

30. The use as defined in [any one of claims 22 to 27] claim 22 for screening 
a subtilisin-kexin isoenzyme named SKI-1. 

35. The use as defined in claim 33 [or 34], wherein said inhibitor is defined in 
[any one of claims] claim 2[, 4 to 6, 8 and 10]. 



36. A composition comprising a SKI-1 fragment as defined in [any one of 
claims] claim 1 [to 6, or a nucleic acid defined in any one of claims 7 to 10, or a 
recombinant vector as defined in any one of claims 7 to 1 0, or a recombinent vector as 
defined in any one of claims 1 1 to 14]. 

Please add the following claims: 

38. (New Claim) A composition comprising an SKI-1 fragment as defined in 
claim 2. 

39. (New Claim) A composition comprising a SKI-1 fragment as defined in 
claim 3. 

40. (New Claim) A composition comprising a SKI-1 fragment as defined in 
claim 4. 

41. (New Claim) A composition comprising a SKI-1 fragment as defined in 
claim 5. 

42. (New Claim) A composition comprising a SKI-1 fragment as defined in 
claim 6. 

43. (New Claim) A composition comprising a nucleic acid as defined in claim 

7. 

44. (New Claim) A composition comprising a nucleic acid as defined in claim 



45. (New Claim) A composition comprising a nucleic acid as defined in claim 

9. 



46. (New Claim) A composition comprising a nucleic acid as defined in claim 

10. 

47. (New Claim) A composition comprising a recombinant vector as defined 
in claim 7. 

48. (New Claim) A composition comprising a recombinant vector as defined 
in claim 8. 

49. (New Claim) A composition comprising a recombinant vector as defined 
in claim 9. 

50. (New Claim) A composition comprising a recombinant vector as defined 
in claim 10. 

51 . (New Claim) A composition comprising a recombinant vector as defined 
in claim 1 1 . 

52. (New Claim) A composition comprising a recombinant vector as defined 
in claim 12. 

53. (New Claim) A composition comprising a recombinant vector as defined 
in claim 13. 

54. (New Claim) A composition comprising a recombinant vector as defined 
in claim 14. 

REMARKS 

The above amendments are being made to eliminate multiple dependencies in 
the claims of this application. 
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No fee is believed necessary to enter this amendment. However if a fee is 
necessary, please charge Deposit Account 17-0055. 

Applicant respectfully requests that the preliminary amendment described herein 
be entered into the record prior to examination and consideration of the above-identified 
application. 



Date: May 1 , 2001 

QUARLES & BRADY 
41 1 East Wisconsin Avenue 
Milwaukee Wl 53202-4497 
U.S.A. 

(414)277-5709 



QUARLES & BRADY LLP 
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Attorney of Record 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicants: Nabil G. Seidah, et al. 

Serial No. : 09/830,837 
Filed: April 30, 2001 

For: MAMMALIAN SUBTILISIN/KEXIN ISOZYME SKI-1: 

A PROPROTEIN CONVERTASE WITH A UNIQUE 
CLEAVAGE SPECIFICITY 

Group Art Unit: 
Examiner : 

Commissioner for Patents 
Washington, D.C. 20231 

PRELIMINARY AMENDMENT 



Dear Sir: 

In the matter of the above -identified case, Applicants wish 
to make the following amendments. 

In the Specification : 

1 . Please amend paragraph 2, page 2, to read as follows: 

It was further discovered by Cheng, D. et al. (1999) J. Biol. Chem. 274.22805-22812 
that an enzyme called Sip, is capable of cleaving sterol-regulatory element-binding proteins 
(SREBPs), which function to control lipid biosynthesis and uptake in animal cells. Upon 
cleavage, SREBPs are released from cell membranes for translocation to the nucleus, where 
they activate transcription of genes involved in the biosynthesis and uptake of cholesterol and 
fatty acids. Sip and the present enzyme or the same. Therefore, for diseases involving 



overexpression of these genes as well as any other disease involving SKI-1 activity, it is 
contemplated that any inhibitor of SKI-1 would be useful in their treatment 

2. On page 27, please amend Paragraph 1, as follows: 

Genetic and biochemical evidence indicates that SKI-1/Slp is the protease that cleaves 
sterol-regulatory element-binding proteins (SREBPs) which functions to control lipid 
biosynthesis and uptake in animal cells { Sakai, J. et al. (1998) Molecular Cell 2, 505-514; 
Cheng, D. et al. (1999) J. Biol. Chem. 274, 22805-22812; Toure, A. et al. (1999) In: Peptides 
for the Now Millennium: Proceedings of the 16 th American Peptide symposium}. SKI-1 and 
SREBPs play critical roles in the feedback pathways by which cholesterol suppresses 
transcription of genes encoding HMG CoA reductase and other enzymes of cholesterol 
biosynthesis as well as the low density lipoprotein ( LDL) receptor. A SKI-1 inhibitor would 
be of use under clinical conditions in which there is not sufficient down regulation of SREBP 
dependent transcription by sterols. For example, in the Nieman-Pick group of diseases a high 
sphingomylin content of cells leads to an increase in proteolysis of SREBP-2 and a subsequent 
increase in cholesterol biosyntheses { Scheek, S. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 
11179-11183; Spence, M.W., and Callahan, J.W. (1989) Spingomyelin-cholesterol lipidoses: 
The Nieman-Pick Group of Diseases. In The Metabolic Basis of Inherited Disease ) Scriver, 
C.R., Beaudet, A.L., Sly, W.S., and Valle, D., editors ), McGraw-Hill Publ. Co., 6 th edition, 
chapter 66, 1655-1676; Sviridov, D. (1999) Histology & Histopathology 14 (1): 305-319 }. 
Perhaps of greater significance, nuclear SREBP- lc protein levels were significantly elevated 
in mouse models for non-insulin dependent diabetes, ob/ob and aP2 SREBP- lc mice, which 
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was associated with elevated mRNA levels for known SREBP target genes involved in the 
biosynthesis of fatty acids (Schimomura, I. etal. J. Biol. Chem. 1999; 274:30028-30032). 

3. On page 50, please amend the last paragraph to read: 

Results of immunocytochemistry performed in mouse lacrimal glands provides 
evidence for the presence of SKI- 1 and APP in the same cells types, including intralobular 
duct epithelial cells and some acinar cells (Fig. 26). The finding of SKI-1 in the lacrimal 
gland suggests the possibility of developing a diagnostic assay analyzing tears; perhaps based 
on two-dimensional polyacrylamide gel electrophoresis for disease diagnosis { Moley, M.P. et 
al. (1997) Electrophoresis 18, 2811-2815; Glasson, M.J. et al. (1998) Electrophoresis 19, 852- 
855; Grus, F.H., and Augustin, AJ. (1999) Electrophoresis 20, 875-880; Iskeleli, G. et al. 
( 1 999) CLAO Journal, 25 : 1 0 1 - 1 04; 

4. On page 72, please amend Example 1 . 3. to read as follows: 

Seidah, N.G., Mbikay, M., Marcinkiewicz, M., & Chretien, M. (1998) in Proteolytic 
and Cellular Mechanisms in Prohormone and Proprotein Processing, ed. Hook, V. Y.H. (R.G. 
Landes Company, Georgetown, TX), pp. 49-76. 

5. On page 72, please amend Example 1 . 4., to read as follows: 

Ling, N., Burgus, R., & Guillemin, R. (1976) Proc. Natl. Acad. Sci. USA 73, 3942- 

3946. 
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6. On page 74-75, please amend Example 2, 10., to read as follows: 

Seidah, N.G., Mbikay, M., Marcinkiewicz, M. and Chretien, M., The mammalian 
precursor convertases: paralogs of the subtilisin/kexin family of calcium-dependent serine 
proteinases. In: Hook, V.Y.H. (Ed.), Proteolytic and Cellular Mechanisms in Prohormone 
and Proprotein Processing. R.G. Landes Company, Georgetown, TX, USA, 1998, pp. 49-76. 

7. On page 75, please amend Example 2 , 13., to read as follows: 

Hallenberger, S., Moulard, M., Sordel, M., Klenk, H.D., and Garten, W. - The role of 
eukaryotic subtilisin-like endoproteases for the activation of human immunodeficiency virus 
glycoproteins in natural host cells. - Journal of Virology 1997;71; 1036-1045. 

8. On page 77, please amend Example 3, 4 th reference, to read as follows: 

Ling, N., Burgus, R., and Guillemin, R. (1976) Proc. Natl. Acad. Sci. USA 73, 3942- 

3946. 

9. On page 78, please amend Example 3 , 21 st reference, to read as follows: 
Rittenhouse, J., and Marcus, F. (1984) Anal. Biochem. 138, 442-448 

10. On page 79, please amend Example 3 , 35 th reference, to read as follows: 

Zhong, M., Munzer, J.S.., Basak, A., Benjannet, S., Mowla, S.J., Decroly, E., Chretien, M. 
and Seidah, N.G. (1999) J. BioL Chem. 274:33913-33920. 
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No fees are believed necessary to enter this amendment. 
However, if any fees are necessary, please charge Deposit Account 
17-0055 . 

Respectfully submitted, 
Nabil G. Seidah, et al . 



October 18, 2001 By: 

Jea/i\C. Baker 



QUARllES & BRADY LLP 
411 Bast Wisconsin Avenue 
Milwaukee, WI 532 02 
RegYNo.: 35,433 
(414) 277-5709 
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MARKED PAGES SHOWING AMENDMENTS MADE 



WO 00/26348 -2- PCT/CA99/01058 

It was further discovered by [Chang] Cheng, D. et al. (1999) J. Biol. Chem. 
274.22805-22812 that an enzyme called Sip, is capable of cleaving sterol-regulatory 
element-binding proteins (SREBPs), which function to control lipid biosynthesis and uptake 
in animal cells. Upon cleavage, SREBPs are released from cell membranes for translocation 
to the nucleus, where they activate transcription of genes involved in the biosynthesis and 
uptake of cholesterol and fatty acids. Sip and the present enzyme or the same. Therefore, for 
diseases involving overexpression of these genes as well as any other disease involving SKI-1 
activity, it is contemplated that any inhibitor of SKI-1 would be useful in their treatment. 



WO 00/26348 -27- PCT/CA99/01058 

Genetic and biochemical evidence indicates that SKI- 1 /Sip is the protease that 
cleaves sterol-regulatory element-binding proteins (SREBPs) which functions to control lipid 
biosynthesis and uptake in animal cells { Sakai, J. et al. (1998) Molecular Cell 2, 505-514; 
Cheng, D. et al. (1999) J. Biol. Chem. 274, 22805-22812; Toure, A. et al. (1999) In: Peptides 
for the Now Millennium: Proceedings of the 16 th American Peptide symposium}. SKI-1 and 
SREBPs play critical roles in the feedback pathways by which cholesterol suppresses 
transcription of genes encoding HMG CoA reductase and other enzymes of cholesterol 
biosynthesis as well as the low density lipoprotein ( LDL) receptor. A SKI-1 inhibitor would 
be of use under clinical conditions in which there is not sufficient down regulation of SREBP 
dependent transcription by sterols. For example, in the Nieman-Pick group of diseases a high 
sphingomylin content of cells leads to an increase in proteolysis of SREBP-2 and a 
subsequent increase in cholesterol biosyntheses { Scheek, S. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94, 11179-11183; Spence, M.W., and Callahan, J.W. (1989) Spingomyelin- 
cholesterol lipidoses: The Nieman-Pick Group of Diseases. In The Metabolic Basis of 
Inherited Disease ) Scriver, C.R., Beaudet, A.L., Sly, W.S., and Valle, D., editors ), McGraw- 
Hill Publ. Co., 6 th edition, chapter 66, [1655-1675] 1655-1676 : [Svirirodov] Sviridov, D. 
(1999) Histology & Histopathology 14 (1): 305-319 }. Perhaps of greater significance, 
nuclear SREBP- lc protein levels were significantly elevated in mouse models for non-insulin 
dependent diabetes, ob/ob and aP2 SREBP- lc mice, which was associated with elevated 
mRNA levels for known SREBP target genes involved in the biosynthesis of fatty acids 
(Schimomura, I. et al. J. Biol. Chem. 1999; 274:30028-30032). 
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Results of immunocytochemistry performed in mouse lacrimal glands provides evidence for 
the presence of SKI-1 and APP in the same cells types, including intralobular duct epithelial 
cells and some acinar cells (Fig. 26). The finding of SKI-1 in the lacrimal gland suggests the 
possibility of developing a diagnostic assay analyzing tears; perhaps based on two- 
dimensional polyacrylamide gel electrophoresis for disease diagnosis { Moley, M.P. et al. 
(1997) Electrophoresis 18, 2811-2815; Glasson, M.J. et al. (1998) Electrophoresis 19, 852- 
855; Grus, F.H., and Augustin, A.J. (1999) Electrophoresis 20, 875-880; Iskeleli, G. et al. 
(1999) [Electrophoresis 20, 875-880 }.] CLAP Journal 25:101-104; 



WO 00/26348 - 72 - PCT/CA99/01048 

Seidah, N.G., Mbikay, M., Marcinkiewicz, M., & Chretien, M. (1998) in Proteolytic 

and Cellular Mechanisms in Prohormone and [Neuropeptide Precursor] Proprotein 

Processing, ed. Hook, V.Y.H. (R.G. Landes Company, Georgetown, TX), pp. 49-76. 

Ling, N., Burgus, R., & Guillemin, R. (1976) Proc. Natl. Acad. Set USA 73, [3042- 

3046] 3942-3946 . 



WO 00/26348 - 74 - PCT/CA99/01058 

10. Seidah, N.G., Mbikay, M., Marcinkiewicz, M. and Chretien, M., The 
mammalian precursor convertases: paralogs of the subtilisin/kexin family of calcium- 
dependent serine proteinases. In: Hook, V.Y.H. (Ed.), Proteolytic and Cellular Mechanisms 
in Prohormone and [Neuropeptide Precursor] Proprotein Processing. R.G. Landes Company, 
Georgetown, TX, USA, 1998, pp. 49-76. 



WO 00/26348 - 75 - PCT/CA99/01058 

13. Hallenberger, S., Moulard, M., Sordel, M., Klenk, H.D., and Garten, W. - The 
role of eukaryotic subtilisin-like endopro teases for the activation of human 
immunodeficiency virus glycoproteins in natural host cells. - Journal of Virology 1997;71; 
[1036-1455] 1036-1045 . 



WO 00/26348 - 77 - PCT/CA99/01058 

Ling, N., Burgus, R., and Guillemin, R. (1976) Proc. Natl. Acad. Sci. USA 73, [3042- 
3046] 3942-3946 . 



WO 00/26348 -78 - PCT/CA99/01058 

[Rittenhose] Rittenhouse . J., and Marcus, F. (1984) Anal. Biochem. 138, 442-448 



WO 00/26348 -79 - PCT/CA99/01058 

Zhong, M., Munzer, J.S., Basak, A., Benjannet, S., Mowla, S.J., Decroly, E., 
Chretien, M. and Seidah, N.G. (1999), J. Biol. Chem. [(in press)] 274:33913-33920 . 



Fab-2B-01 12:10 From-GOUDREAU GAGE DUBUC 
.. 26-02-2001 



^ -76- 

^ * t 3. The recombinant vector of claim 12, which comprises a promoter expressible in 

a target cell wherein expression of said nucleic acid is desirable. 

14. The recombinant vector of claim 12, which comprises an inducible promoter. 

15. A recombinant host cell comprising the recombinant vector defined in any one of 
5 claims 11 to 14. 

16. A method of producing a proteic fragment of SKI-1 enzyme, which comprises the 
steps of: 

culturing a recombinant host cell expressing a nucleic acid as defined in any one 
of claims 7 to 10 in a cell growth and expression-supportive culture medium; and 
1 0 recovering said proteic fragment of SKI-1 in the culture medium. 

1 7. A method for cleaving a substrate for SKI-1 enzyme, which comprises the step of: 

a) contacting said substrate with a SKI-1 enzyme which has 1 ) an amino 

acid sequence defined by amino acids 18 to 1052 of any one of SEQ ID Nos: 2, 4, 6 and 
an active variant thereof, or 2) a SKI-1 soluble fragment as defined in claim 1, or 3) a 
1 5 catalytic part of a) or b). or 4) a complex as defined in claim 3, for a time sufficient and in 
conditions adequate for such cleavage to occur, 

with the proviso that said substrate is not a sterol-regulatory element-binding protein 
(SREBP). 

1 8. A method for producing a protein or a peptide from a proteic precursor which is an 
20 enzymatic substrate for SKI-1 enzyme, which comprises the steps of: 

a) contacting said proteic precursor with a SKI-1 enzyme which has 1) an 
amino acid sequence defined by amino acids 18 to 1052 of any one of SEQ ID Nos: 2, 
4, 6 and an active variant thereof, or 2) a SKI-1 soluble fragment as defined in claim 1, 
or 3) a catalytic part of a) or b), or 4) a complex as defined in claim 3, for a time sufficient 

26 and in conditions adequate for such cleavage to occur; and 

b) recovering said protein or peptide; 
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fcgpV- with the proviso that said substrate is not a sterol-regulatory element-binding protein 

(SREBP). 

1 9. The method of claim 1 7, which takes place in a ceil or in the presence of a cellular 
population and wherein step a) comprises the step of transfecting a ceil with a nucleic acid 

5 expressing said SKI-1 enzyme. 

20. The method of claim 19, wherein said eel) expresses said proteic precursor or is 
transfected with a nucleic acid expressing said proteic precursor. 

21. A method of inhibiting the activity of a subtilisin-kexin isoenzyme named SKI-1, 
which comprises the step of contacting S KM with the inhibitor defined in any one of claims 

10 4 to 6, 8 and 10. 

I- 22. A peptide of at least 7 amino acids capable of binding to and of being cleaved by 

I - SKI-1 catalytic site, comprising the following general formula: 

Arg Xaai J Xaa2 I Xaa3 (Z) n O 

wherein Xaat ? z. 3 and z are anv amino acid 

15 J is an alkyl or aromatic hydrophobic amino acid 

L nisl,2or3 

O is an acidic amino acid, 

with the proviso that the peptide does not comprise the sequence Lys 
- Arg - Phe - Val - Phe - Asn - Lys - He - Glu. 

20 23. A peptide as defined in claim 22, wherein Xaa2 is Lys, Leu, Phe or Thr. 

24. A peptide as defined in claim 23 which has the sequence: 

H2N-Val-Phe-Arg-Ser-Leu-Lys-Tyr-A!a-Glu-Ser-Asp-COOH. 

25. A peptide as defined in any one of claims 22 to 24 which is labelled. 

26. A peptide as defined in claim 25 which is fluorogenic. 
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27. 



A peptide as defined in claim 26 which is 



-78- 



Abz-Val-Phe-Arg-Ser-Leu-Lys-Tyr-Ala-Glu-Sef-Asp-Tyr(N02), 



wherein 



Abz is orthoaminobenzoic acid, and 



5 



Tyr(NQ2)is 3-nitrotyrosine. 



28. The use of a peptide as defined in any one of claims 22 to 27 for monitoring the 
activity of a subtitisin-kexin isoenzyme named SKI-1 - 

29. The use as defined in any one of claims 22 to 27 for screening inhibitors of a 
subtilisin-kexin isoenzyme named SKI-1. 

1 0 30. The use as defined in any one of claims 22 to 27 for screening a subtilisin-kexin 
isoenzyme named SKI-1 . 

31 . The use of a peptide of at least 7 amino acids capable of binding to and of being 
cleaved by SKI-1 catalytic site, comprising the following general formula: 



for monitoring the activity of a subtilisin-kexin isoenzyme named SKI-1 . 

20 32. The use of a peptide of at least 7 amino acids capable of binding to and of being 
cleaved by SKI-1 catalytic site, comprising the following general formula: 



Arg Xaai J Xaa2 I Xaa3 (Z) n O 



15 wherein 



Xaa-j , 2, 3 and ? are any amino acid 



J is an alkyl or aromatic hydrophobic amino acid 



n is 1, 2 or 3 



O is an acidic amino acid 



Arg Xaai J Xaa2 I Xaa3 (2) n o 



wherein 



Xaai t 2, 3 and 2 are ar| y amino acid 
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J is an alkyi or aromatic hydrophobic amino acid 
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n is 1 , 2 or 3 



O is an acidic amino acid 



for screening inhibitors or substrates of a subtilisin-kexin isoenzyme named SKi-1 . 

S 33. The use of an inhibitor of the activity of a subtiiisin-kexin isoenzyme named SKl-1 
in the making of a medication for treating a disease involving an overexpression of a SKI- 
1 or a SKI-1 substrate. 

34. The use as defined in claim 33 , wherein said disease is associated with any one 
of hypercholesterolemia, high levels of fatty acids, lipids or famesyl pyrophosphate, liver 

10 steatosis, Ras-dependent cancer, restenosis and amyloid protein formation. 

35. The use as defined in claim 33 or 34, wherein said inhibitor is defined in any one 
of claims 2, 4 to 6, 8 and 10. 

36. A composition comprising a SKl-1 fragment as defined in any one of claims 1 to 
6, or a nucleic acid defined in any one of claims 7 to 10, or a recombinant vector as 

1 5 defined in any one of claims 7 to 10, or a recombinant vector as defined in any one of 
claims 1 1 to 14. 

37. The use of a SKl-1 enzyme as encoded by nucleic acids to 1 8 to 1 052 of SEQ ID 
NOs: 1 , 3 or 5, or of a catalytic part that is unique to SKl-1 enzyme, or of an active variant 
thereof, the nucleic acid of the variant sharing at least 70% homology with the nucleic 

2D defined in SEQ ID NOs.: 1 , 3 and 5 and hybridizing therewith under stringent hybridization 
conditions, for cleaving a proteic precursor, with the proviso that said proteic precursor 
is not a sterol-regulatory element-binding protein (SREBP). 
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TITLE OF THE INVENTION: 

Mammalian subtili sin/kexin isozyme SK1-1 : a proprotein convertase with a _unique 
cleayag e_specificity 

5 FIELD OF THE INVENTION: 

This invention relates to a serine proteinase capable of converting proteic 
precursors into mature proteins; particularly a serine proteinase capable of cleaving 
at non-basic amino acid residues. 



10 BACKGROUND OF THE INVENTION: 

Limited proteolysis of inactive precursors to produce active peptides and 
proteins is an ancient mechanism to generate biologically diverse products from a finite 
set of genes. Most often, such processing occurs at either single or dibasic residues, 
as a result of cleavage by a family of mammalian serine proteinases related to 

15 bacterial subtilisin and yeast kexin(1, 2). These enzymes, known as pro-protein 
convertases (PCs), participate in the tissue-specific intracellular processing of 
precursors at the consensus (R/K)-(X)„-Ri sequence, where X is any amino acid 
except Cys and n = 0, 2, 4 or 6 (1-3). PCs have been implicated in the production of 
various bioactive polypeptide hormones, neuropeptides, enzymes, growth factors, 

20 adhesion molecules, cell surface receptors and surface glycoproteins of infectious 
agents such as viruses and bacteria (1-3). 

Less commonly, bioactive products can also be produced by limited proteolysis 
at amino acids such as Leu, Val, Met, Ala, Thr, Ser and combinations thereof (3). This 
type of cellular processing has been implicated in the generation of bioactive peptides 

25 such as a-and v-endorphin (4), the C-terminai glycopeptide fragment 1-19 of pro- 
vasopressin (5), anti-angiogenic polypeptides such as platelet factor 4 (6) and 
angiostatin (7), the metalloprotease ADAM-10 (8), site 1 cleavage of the sterol receptor 
element binding proteins (9), as well as in the production of the Alzheimer's 
amyloidogenic peptides A(540, 42 and 43 (10). Processing of this type occurs in the 

30 endoplasmic reticulum (ER) (9), or late along the secretory pathway, within secretory 
granules (4, 5), at the cell surface, or in endosomes (6-8, 10). So far, the proteinases 
responsible for these cleavages have not been unambiguously identified. 

Since mammalian convertases process precursors at either single or pairs of 
basic residues, we hypothesised that a distinct, but related, enzyme(s) may generate 
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polypeptides by cleavage at non-basic residues. To test that idea, we employed an RT- 
PCR strategy similar to the one used to identify the PCs (11), except that we used 
degenerate oligonucleotides closer to bacterial subtilisin than to yeast kexin. This 
approach resulted in the isolation of a cDNA fragment encoding a putative subtilisin- 
like enzyme from human cell lines. This partial sequence was identical to a segment 
of a human myeloid cells-derived cDNA reported by Nagase et al. (12). A role for this 
putative subtilase remained undefined up to the present invention. 

It was further discovered by Chang, D. era/. (1999) J. Biol. Chem. 274:22805- 
22812 that an enzyme called S1p, is capable of cleaving sterol-regulatory element- 
binding proteins (SREBPs), which function to control lipid biosynthesis and uptake in 
animal cells. Upon cleavage, SREBPs are released from cell membranes for 
translocation to the nucleus, where they activate transcription of genes involved in the 
biosynthesis and uptake of cholesterol and fatty acids. S1 p and the present enzyme 
or the same. Therefore, for diseases involving overexpression of these genes as well 
as any other disease involving SKI-1 activity, it is contemplated that any inhibitor of 
SKI-1 would be useful in their treatment. 

SUMMARY OF INVENTION: 

We show that the sequences of the rat, mouse and human orthologues of this 
putative type-l membrane-bound subtilisin-kexin-lsoenzyme, which we called SKI-1, 
exhibit a high degree of sequence conservation. Tissue distribution analysis by both 
Northern blots and in situ hybridization (ISH) revealed that SKI-1 mRNA is widely 
expressed. A stable transfectant of human SKI-1 in HK293 cells allowed the analysis 
of its biosynthesis and intracellular localization. We present data demonstrating that 
SKI-1 cleaves at a specific Thri residue within the N-terminal segment of human pro- 
brain-derived neurotrophic factor (proBDNF). SKI-1 is the first identified secretory 
mammalian subtilisin/kexin-like enzyme capable of cleaving a proprotein at non-basic 
residues. 

Therefore in accordance with the present invention, there is provided a soluble 
proteic fragment of a subtilisin-kexin isoenzyme named SKI-1 which has the amino 
acid sequence defined by amino acids 187 to 996 of any one SEQ ID NOs: 2, 4 and 
6, a variant thereof, or an enzymatically active part thereof. 
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It is further an object of this invention to provide a proteic fragment of SKI-1 
enzyme, which has the amino acid sequence defined by amino acids 18 to 137 of any 
one of SEQ ID NOs: 2, 4 and 6, a variant thereof, or a part thereof, which is a pro- 
segment capable of binding with amino acids 18 to 1052 of SKI-1 in whole or in part. 
5 A part of this pro-segment has a molecular weight of about 14 KDa and forms 

a tight complex with the soluble fragment of SKI-1 . 

The pro-segment is an inhibitor of SKI-1 activity. 

To improve its inhibitory activity, the pro-segment sequence is modified to 
prevent further enzymatic processing in a cell expressing said proteic fragment. 
10 The modification includes amino acid substitution, deletion or rearrangement. 

Nucleic acids encoding any of the above SKI-1 forms are also objects of this invention. 

Recombinant vectors and hosts comprising these nucleic acids are also objects 
of this invention. 

The recombinant vectors are preferably expression vectors. 
1 5 The recombinant vectors comprise a promoter expressible in a target cell 

wherein expression of said nucleic acid is desirable, be it for a therapeutic or 
manufacturing purposes. 

The recombinant vectors may also comprise an inducible promoter. 

It is further an object of this invention to provide a method of producing a proteic 
20 fragment of SKI-1 enzyme, which comprises the steps of: 

culturing a recombinant host cell expressing a SKI-1 nucleic acid in a cell 
growth and expression-supportive culture medium; and recovering the proteic fragment 
of SKI-1 in the culture medium. 

There is also provided a method for cleaving a proteic precursor which is an 
25 enzymatic substrate for SKI-1 enzyme, which comprises the step of: 

a) contacting the proteic precursor with a SKI-1 enzyme which as an amino 

acid sequence defined by amino acids 18 to 1052 of SEQ ID Nos: 2, 4 or 6, or 

a variant thereof, or the soluble form, for a time sufficient and in condition 

adequate for such cleavage to occur. 
30 The cleavage may be provoked in vivo or in vitro, e.g. serving a therapeutic 

purpose or an industrial protein manufacturing use. 

For the purpose of producing a protein or a peptide from a proteic precursor 
which is an enzymatic substrate for SKI-1 enzyme, the method would further comprise 
the step of: 
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b) recovering and purifying the protein or peptide. 

The method may be performed in cell-free assays, or may take place in a cell 
or in the presence of a cellular population, and wherein step a) comprises the step of 
transfecting a cell with a nucleic acid expressing a SKI-1 protein. 

The cell may express said proteic precursor or may be transfected with a 
nucleic acid expressing the proteic precursor. 

A method of silencing the expression or the activity of SKI-1 enzyme on a 
proteic precursor, which comprises the steps of: 

contacting the enzyme or a nucleic acid encoding the enzyme with a ligand 
molecule which binds to the enzyme or to the nucleic acid, thereby interfering with the 
binding of the enzyme to the proteic precursor or with the expression of the nucleic 
acid encoding the enzyme, is also an object of this invention. 

The ligand molecule may comprise an antisense nucleic acid to the nucleic acid 
encoding SKI-1 , a pro-segment of a precursor protein encoding SKI-1 , a SKI-inhibitor, 
a peptide mimicking a proteic precursor SKI-1 binding site, or an antibody molecule 
directed against SKI-1 , or one which generates an inactive SKI-1 mutant form. 

The pro-segment is a polypeptide extending from amino acids 1 7 to 1 37 of SEQ 
ID NOs: 2, 4, 6, or a variant thereof or an inhibitory part thereof. 

We also provide a peptide of at least 7 amino acids capable of binding to and 
of being cleaved by SKI-1 catalytic active site, comprising the following general 
formula: 

Arg Xaa 1 J Xaa 2 1 Xaa 3 (Z)„ O 
wherein Xaa, 2 3 and Z are any amino acid 

J is an alkyl or aromatic hydrophobic amino acid 
n is 1 , 2 or 3 

O is an acidic amino acid. 
Preferably Xaa 2 is Lys, Leu, Phe or Thr. 
A preferred peptide has the structure: 

H 2 N-Val-Phe-Arg-Ser-Leu-Lys-Tyr-Ala-Glu-Ser-Asp-COOH. 
The peptide may be labelled, a fluorogenic label being one of our preferred 
embodiments. 

A fluorogenic peptide which has the following sequence: 

Abz-Va!-Phe-Arg-Ser-Leu-Lys-Tyr-Ala-Glu-Ser-Asp-Tyr(N0 2 ) 
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has been synthesized. 

These peptides can be used for monitoring SKI-1 activity, for screening 
inhibitors of SKi-1 activity or for screening enhancers of SKI-1 activity. 

An inhibitor of SKI-1 activity used in the making of a medication for treating a 
disease involving an overexpression of a SKI-1 or a SK1-1 substrate, is also a further 
object of this invention, namely the pro-segment modified or not. 

The disease may be associated with any one of hypercholesterolemia, high 
levels of fatty acids, lipids orfarnesyl pyrophosphate, fiver steatosis, Ras-dependent 
cancer, restenosis and amyloid protein formation. 

We also provide a method for detecting SKI-1 activity in a sample, which 
comprises the steps of contacting the sample with a ligand molecule to SKI-1 protein 
or nucleic acid, and detecting the formation of a complex between said ligand and SKI- 
protein or nucleic acid as an indication of the presence of SKI-1 in said sample. The 
ligand includes molecules such as anti-SKI-1 -antibodies or a nucleic acid probes or 
primers. 

Finally is provided a new use for SKI-1 enzyme in whole or in part which is for 
cleaving substrates not cleaved by other members of the subtilisin-kexin family. 
Variants of SKI-1 are under the scope of this invention, such variants are encoded by 
nucleic acids sharing at least 70% homology with the sequences defined in SEQ ID 
NOs: 1, 3, 5. 

DESCRIPTION OF THE INVENTION: 

During our search for new members of the subtilisin-kexin family, we obtained 
two closely related sequences from mouse and rat tissues. When questioning gene 
data banks to find a match with other known sequences, we found that the human 
counterpart has been previously cloned and sequenced. However, no specific function 
for this enzyme was known. We named our new enzyme subtilisin-kexin isoenzyme 
1 (SKI-1). 

We characterized this enzyme and found that SKI-1 has a unique cleavage site 
in cognate substrates. One of these substrates is pro-BDNF. Sakai etai have found 
that another substrate, SREBP-2, which is a sterol-responsive transcription element, 
was cleaved at a first enzyme processing site by an enzyme which they called site 1 
protease (S1p). S1p and SKI-1 appeared to be the same enzyme. 



WO 00/26348 



-6- 



PCT/CA99/01058 



Since SKI-1 is autocatalytically cleaved, this brings to three the number of 
substrates that are known to be recognized and cleaved by SKI-1 . One object of this 
invention is therefore the use of SKI-1 as a protein processing enzyme. 

SKI-1 is ubiquitously distributed and appears to be very well conserved 
amongst mammalian species. Therefore, variants of SKI-1 are within the scope of this 
invention. We have indeed identified two species variants of the human enzyme 
disclosed in gene data banks, and per se this is a proof that variants to screen SKI-1 
activity exist. 

SKI-1 is first located in the endoplasmic reticulum (ER) membrane. Upon 
processing the pro-segment of pro-SKI-1 is removed and SKI-1 is thus activated. SKI-1 
is further processed to remove the transmembrane domain that keeps it integrated in 
the ER membrane, which generates a SKI-1 soluble form that is directed into the 
secretory pathway and which remains active. The soluble active form is indeed 
retrievable in culture media as well as the pro-segment. The pro-segment is itself also 
processed into shorter fragments. One of these fragments has an apparent molecular 
weight of about 14 KDa and forms a tight complex with the soluble SKI-1 form. The 
formation of this complex does not hinder the activity of the enzyme. It is known that 
the pro-segment of pro-protein convertases is inhibitory in vitro to the activity of the 
convertases. We demonstrate for the first time hereinbelow that such a behaviour 
occurs in an ex vivo model. SKI-1 pro-segment also has such an inhibitory activity. We 
predict that a SKI-1 pro-segment that would be modified to prevent the pro-segment 
processing will be an even better SKI-1 inhibitor. Such a modification is made by 
converting an enzyme recognition and cleavage site into a non-cleavable sequence. 
Such modification is intended to cover amino acid substitutions, deletions or re- 
arrangements to provide a SKI-1 pro-fragment that has an improved inhibitory activity. 

The nucleic acids encoding all the above SKI-1 forms (soluble, pro-segment 
and sub-fragments, modified or not) are under the scope of this invention. 
Recombinant vectors and hosts comprising these nucleic acids are also objects of this 
invention. More particularly, expression vectors capable of producing the different SKI- 
1 forms are preferred. The expression vectors comprise promoter sequences which 
govern the expression of the nucleic acids. The promoter may be compatible with the 
cell wherein the expression of the nucleic acid is sought, be it for a therapeutic purpose 
or for the industrial production of SKI-1. The promoter may also be an inducible 
promoter which needs an exogenous inducing agent to activate the expression. For 
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the production of any SKI-1 form, a recombinant host cell may be used and is cultured 
in a culture medium which supports ceil proliferation and the expression of the nucleic 
acids. Under suitable conditions, the SKI-1 form of interest is expressed and may be 
conveniently recovered from the culture medium. 

A general method for cleaving a proteic precursor is also an object of this 
invention. SKI-1 whole active enzyme or its soluble form or catalytically active 
fragments or variants are added to a proteic precursor which is a SKI-1 substrate, in 
conditions adequate for enzymatic precursor processing (cleavage) to occur. This 
method may be performed in vivo for curing a SKI-1 deficiency or in vitro for the 
industrial preparation of active proteins. In the latter case, the processing may be 
performed in a •cell-free assay, using purified proteic precursors and SKI-1 whole 
enzyme or derived forms. Alternatively, it may be performed using transfected cells 
expressing SKI-1 whole enzyme and derived forms. The transfected cells may 
endogenously express the protein precursor or may be co-transfected to express the 
same. The transformed cells therefore become a manufacture of mature proteins 
and/or or SKI-1 . 

Modification of the SKI-1 activity is further an object of this invention. We have 
succeeded in inhibiting SKI-1 activity using the SKI-1 pro-segment. Alternative ways 
to achieve the same results include antisense nucleic acids or oligonucleotides, SKI-1 
inhibitors, peptides mimicking a precursor SKI-1 binding site (cleavable or not), which 
would compete for the binding of SKI-1 to its cognate protein precursor site, and 
antibodies directed against SKI-1 or its cognate proteic precursor binding site. Another 
alternative is a genie therapy replacing the active SKI-1 by an inactive mutant form. On 
the opposite, overexpressing SKI-1 may cure a SKI-1 deficiency. Due to the ubiquitous 
distribution of SKI-1 , it may be useful, even necessary, to target the cell wherein SKI-1 
activity is to be modified for such a therapeutic purpose. Such targeting may include 
conjugating or combining molecules capable of modifying or modulating SKI-1 activity 
to a ligand capable of targeting the cell of interest. Immunoiiposomes are examples of 
targeting vehicles as well as conjugated ligands-oligonucleotides. Even viral vectors 
may be made targeting if they express such a targeting ligand at the membrane 
surface. A targeting ligand serves a selection purpose, leaving substantially intact the 
non-targetted cells. 
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Peptides of less than 1 00 amino acids, more preferably of less than 30 amino 
acids, mimicking a cognate SKI-1 cleaving site in a proteic precursor have been 
synthesized and are also objects of this invention. Therefore, a peptide of at least 7 
amino acids comprising the following preferred structure is capable of binding to and 
5 of being cleaved by SKI-1 enzyme catalytic site: 

Arg Xaa, J Xaa 2 I Xaa 3 (Z) n O 
wherein Xaa, 2 . 3 and Z are any amino acid 

J is an alkyl or aromatic hydrophobic amino acid 
n is 1, 2 or 3 
10 O is an acidic amino acid. 

Preferably Xaa 2 is Lys, Leu, Phe or Thr. 

The preferred peptide has the following sequence: 

H 2 N-Val-Phe-Arg-Ser-Leu-Lys t Tyr-Ala-Glu-Ser-Asp-COOH. 
These peptides may be labelled in such a way that labelled fragments produced 
15 upon cleavage are easily detected and identified. Such labelling include any type of 
suitable detectable markers. We have developed a fluorogenic peptide which shows 
a very good affinity for SKI-1. The above preferred peptide has been labelled at its N- 
and C- terminal ends with an orthoaminobenzoic acid and 3-nitrotyrosine groups, 
respectively. 

20 These peptides as well as ceil lines expressing SKI-1 will be especially useful 

for monitoring SKI-1 activity and for screening inhibitors or substrates and enhancers 
of SKI-1 activity. 

Inhibitors of SKI-1 , namely the SKI-1 pro-segment, will be used in the making 
of a medication for treating a diseasing involving overexpression of SKI-1 or of its 
25 substrate. 

Conversely, substrates of SKI-1 will be used in the research field to discover 
physiological systems involving SKI-1 . 

Diagnostic methods and kits comprising a ligand to SKI-1 protein or nucleic 
acid, which is to be contacted with a sample suspected to express SKI-1 , is also an 
30 object of this invention. Detection of the formation of a ligand-SKI-1 complex or of a 
hybridization complex is an indication of the presence or amount of SKI-1 in the 
sample. 

Since we were the first to discover the function of SKI-1 enzyme, the use 
thereof for cleaving proteic precursors that are not substrates for the other members 
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of the subtilisin-kexin family is an object of this invention. SKI-1 is intended in this 
broad use to cover the whole enzyme, a catalytic part thereof and its functional 
variants. Variants are encoded by anyone of the nucleic acids depicted in SEQ ID Nos: 
1, 3 or 5, and any other sequences sharing at least 70% homology therewith, 
5 preferably more than 85% homology, under stringent conditions of hybridization. 

Having now defined the general teachings of the present invention, reference 
will be made hereinbelow to specific examples and embodiments as well to the 
following appended figures, which purpose is to illustrate the invention rather than to 
10 limit its scope. 

BRIEF DESCRIPTION OF FIGURES: 

FIG. 1 shows the comparative protein sequences of SKI-1 deduced from rat, mouse 
and human cDNAs {SEQ ID NOs 2, 4, and 6, encoded by nucleic acids SEQ ID NOs: 

15 1,3, and 5, respectively). The position of the predicted end of the 17 aa signal peptide 
is shown by an arrow. The active sites Asp 218 , His 249 and Ser 414 , as well as the oxyanion 
hole Asn 338 are in bold, shaded and underlined characters. The positions of the 6 
potential N-glycosylation sites are emphasized in bold. The conserved shaded 
CLDDSHRQKDCFW sequence fits the consensus signature for growth factors and 

20 cytokine receptors family. Each of the two boxed sequences was absent in a number 
of rat clones. The predicted transmembrane segment is in bold and underlined. 
FIG. 2 shows a Northern blot analysis of the expression of SKI-1 in adult rat tissues. 
[A] 5 ug of male rat total RNA were loaded in each lane. Molecular sizes are based on 
the migration of an RNA ladder. The tissues include: adrenal, thyroid, striatum, 

25 hippocampus, hypothalamus, pineal gland, anterior (AP) and neurointermediate (NIL) 
lobes of the pituitary, submaxillary giand, prostate, ovary and uterus. Notice the high 
level of SKI-1 mRNA in adrenal glands. [B] 2 ug of poly-A+ of (male + female) Sprague 
Dawiey rat adult tissues (Bio/Can Scientific) were loaded, which includes: liver, 
thymus, spleen, kidney, heart and brain. The estimated size of rat SKI-1 mRNA is 

30 about 3.9 kb. 
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FIG. 3 shows in situ hybridization {15 H) of rSKI-1 mRNA in a 2 day old rat. ISH is 
shown at anatomical resolution on X-ray film using an [ 35 S]-labe!ed antisense riboprobe 
[A-C] and sense control riboprobe [D], Abbreviations: Adr - adrenal gland; Cb - 
cerebellum; cc - corpus callosum; Cx - cerebral cortex; H - heart; Int - intestine; K - 
5 kidney; Li - liver; Lu - lungs; M - muscles; Mol - molars; OT - olfactory turbinates; Pit - 
pituitary gland; Rb - ribs; Ret- retina; Sk- skin, SM - submaxillary gland; Th - thymus. 
Magnification x 4; scale bar (in D) = 1cm. 

FIG. 4 illustrates the biosynthetic analysis of SKI-1 in HK293 cells. Stable transfectants 
expressing either the pcDNA3 vector alone or one that expresses SKI-1 (clone 9) were 

1 0 pulse-labeled for 4h with [ 35 S]Met. Media and cell lysates were immunoprecipitated with 
either a SKI-1 antiserum (Ab: SKI; against aa 634-651) or a pro-SKI-1 antiserum (Pro). 
The stars represent the 4 specific intracellular proteins (Mr 148, 120, 106 and 98 kDa) 
immunoprecipitated with the SKI-1 antiserum. In these transfected cells, only the 148 
kDa band is recognized by the Pro-antiserum. A 98 kDa immunoreactive SKI-1 s 

1 5 protein is also detectable in the medium. 

FIG. 5 shows hSKI-1 immunoreactivity in stably transfected HK293 cells. 
Representation of the comparative double fluorescence staining using a SKI-1 
antiserum (directed against aa 634-651) [A] and [B] and FITC-labeled WGA [A'] and 
[B'J in control [A, A'] and LME-treated [B, B'] cells is shown. Thin arrows emphasize the 

20 observed punctate staining which is enhanced in the presence of LME. Large arrows 
point to the coincident staining of SKI-1 and WGA. Magnification x 900; bar (in B') = 
10 urn. 

FIG. 6 shows the processing of proBDNF by SKI-1 . [A] COS-7 cells were infected with 
w:BDNF and either w:WT (-) or vv:SK!-1 in the presence of either vv: PIT or vv.PDX. 

25 The cells were metabolically labeled with [ 35 S]Cys-Met for 4h and the media (M) and 
cell lysates (C) were immunoprecipitated with a BDNF antiserum, prior to SDS-PAGE 
analysis. The autoradiogram shows the migration positions of proBDNF (32 kDa), the 
28 kDa BDNF produced by SKI-1 and the 14 kDa BDNF. [B] Microsequence analysis 
of the [ 35 S]Met-labeled 32 kDa proBDNF (maximal scale 1000 cpm) and [ H]I_eu- 

30 labeled 28 kDa BDNF (maximal scale 250 cpm), revealing a Met at sequence position 
3 and Leu at positions 2, 13 and 14, respectively. 



WO 00/26348 



- 11 - 



PCT/CA99/01058 



FIG. 7 shows the in vitro processing profile of proBDNF by SKI-1. [A] pH dependence 
of the processing of proBDNF by SKI-1 . The SKI-1 enzyme preparation was compared 
to that obtained from the media of Schwann cells infected with the wild type virus (WT) 
as control. [B] Inhibitor profile of the processing of proBDNF to the 28 kDa BDNF by 
5 the same SKI-1 preparation as in [A]. The reaction was performed overnight at 37°C, 
pH 6.0. Notice that only PMSF (0.5 mM PMSF+50 uM pAPMSF), o-phenanthroline (5 
mM), and EDTA (10 mM) effectively inhibited SKI-1 cleavage of proBDNF. 
FIG. 8 shows the in situ hybridization translating SKI-1 mRNA expression in the 
pituitary gland of an adult rat using specific [ 35 S]radiolabeled antisense (SKI AS) and 
10 control sense (SKI SS) riboprobes. The hybridization signal was detected in the 
anterior (AL), intermediate (IL) and posterior pituitary lobe (PL). Most of the labeling 
was confined to endocrine cells in AL and IL and to some pituicytes in the PL. 
Magnification x 5; bar (in b) = 1 mm. 

FIG. 9 shows the in situ hybridization translating the presence of SKI-1 mRNA sites 

15 in the skin of a newborn two days old (p2) rat using antisense (SKI AS) and control 
sense (SKI SS) riboprobes. The hybridization signal was detected in the stratum 
germinativum (small vertical arrows in SGe), in both outer and inner hair sheath 
(medium arrows) and in some cells within the dermis (D). Other abbreviations: HB - 
hair bulb, SC - stratum corneum, SGr - stratum granulosum. Magnification x 80. 

20 Fig. 10 shows the in situ hybridization (ISH) distribution of SKI-1 mRNA in the rat 
central nervous system (CNS). ISH distribution pattern in the CNS of adult rat 
demonstrates a higher concentration of SKI-1 mRNA within a grey matter (GM and all 
structures indicated with capital letters) vs the white matter (WM) including corpus 
callosum (cc). Representative brain structures are shown in sagittal (a); horizontal (b) 

25 and coronal plane (c - f) after hybridization with antisense SKI-1 riboprobe (a - e) and 
control sense riboprobe (ssRNA in f). As shown at anatomical level this type of mRNA 
distribution is highly reminiscent to a type of pan-neuronal gene distribution pattern. As 
complementary to this figure a Table 1 demonstrates at cellular level the predominance 
of neuronal SKI-1 mRNA expression over glial SKI-1 mRNA expression. Magnification 

30 x 4; bar (in a) = 1 cm. Abbreviations: CA1 - area 1 of cornus Ammonis; CA3 - area 3 
of cornus Ammonis; Cb - cerebellum; cc - corpus callosum; Ch PI - choroid plexus; Cx 
- cerebral cortex; GD - gyrus dentatus; GM - grey matter; Hip - hippocamp; Hy - 
hypothalamus; Ol - olfactory bulb; Str - striatum; WM - white matter. 
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Fig. 11 shows the in situ hybridization (ISH) distribution of SKI-1 mRNA in the rat 
peripheral nervous system (PNS) trigeminal ganglion (TriG). ISH distribution pattern 
in the CNS of adult rat demonstrates a higher concentration of SKI-1 mRNA within a 
region of cell bodies (large arrows) over the region of supportive Schwann cells (small 
5 arrows). ISH was performed using antisense (SKI-1 as in a) and sense (SKI-1 ss) 
riboprobes. Magnification x 12. 

Fig. 12 shows the distribution of SKI-1, mRNA and/or protein, in the region of spinal 
cord (SpC) and in the related dorsal root ganglion (DRG) and dorsal root (DR). 
Demonstrated are the region of neuronal cell bodies in the DRG (SKI-1 mRNA) and 
10 the region of nerve terminals in the dorsal horn of the spinal cord (layer I and II) 
characterized by a especial density of SKI-1 protein. 

A) Schematic drawing depicting the position of layer I and II in the dorsal horn as 
well as that of the related DRG and DR. 
15 B) SKI-1 mRNA revealed by in situ hybridization labeling (thin arrows) in the DRG 
using antisense riboprobes (SK!-1 AS). 

C) Control hybridization in the DRG using sense riboprobes (SKI-1 SS). 

D) Immunocytochemical localization of SKI-1 (brown staining) within layer I and II 
of the dorsal horn and in the dorsal root (DR) suggesting the sensory afferents arriving 

20 from DRG. Neuronal and glial nuclei are stained on blue. Magnification x 300. 

E) Immunoreactivity of SKI-1 (thin arrows) detected around neuronal somata (large 
arrows) within layer II of the dorsal horn at high magnification (x 1,500). Pattern of 
immunoreactive spots is reminiscent to that of axo-somatic or axo-dendritic nerve 
terminals. 

25 F) Northern blot revealing the concentrations of 4 kb SKI-1 mRNA in different 
tissues including dorsal root ganglia (DRG) and spinal cord (SpC). Abbreviations: I - 
layer I of the dorsal horn; II - layer II of the dorsal horn; Adr - adrenal gland; Cb - 
cerebellum; Cx - cerebral cortex; Hip - hippocamp; DH - dorsal horn; DR - dorsal root; 
DRG - dorsal root ganglion; SpC - spinal cord; Stom - stomach and Thyr - thyroid 

30 gland. 

Fig. 13 shows the farnesyl diphosphatase mRNA levels in HK 293 cells treated with 
(+)lipids (cholesterol and 25-hydroxycholesterol) or without lipids (-). 1-2 = wild type 
cells, 3-4 = SREBP-1 overexpressors, 5-6 = a pool of 3 different clones overexpressing 
SREBP-1 and Pro-SKI-1 ; clones 4,6,9. 
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Fig. 14 shows the fatty acid synthase mRNA levels in HK 293 cells treated with (+) 
lipids {cholesterol and 25-hydroxycholesterol) or without lipids (-). 1-2 = wildtype cells, 
3-4 = SREBP-1 overexpressors, 5-6 = a pool of 3 different clones overexpressing 
SREBP-1 and Pro-SKI-1; clones 4,6,9. 
5 Fig. 15 shows the HMG CoA reductase mRNA levels in HK 293 celts treated with lipids 
(box A) or without lipids (box B). 1 = wild type cells, 2 = vector only cells, 3 = SREBP-1 
overexpressor cells, 4 = SREBP-1 and ProSKI-1 overexpressor cells (high SREBP 
expression, clone 6), 5 = SREBP-1 and ProSKI-1 overexpressor cells (low SREBP 
expression, clone 9). 

10 Fig. 16 shows the HMG CoA reductase and farnesyl diphosphatase mRNA levels in 
Hk 293 cells in different clones overexpressing SREBP-1 (1-5) or SREBP-1 and 
ProSki-1 (clone 4, clone 6, clone 9). Cells were treated with fetal calf serum. 
Fig. 17 shows the nuclear SREBP-1 in HK 293 cells in absence of lipids. Mature 
SREBP is processed in the ER and translocated into the nucleus. 1 = wild type cells, 

15 2 = vector only cells, 3 = SREBP-1 overexpressors, 4 = SKI-1 antisense cells, 5 = 
ProSki + SREBP-1 overexpressors clone 6, 6 = ProSKI + SREBP-1 overexpressors 
clone 9. 

Fig. 18 shows the processing of cytoplasmic SREBP-1 in HK 293 cells. 50 ug of 
protein per lane was separated in 6 % (above) and 10 % (below) SDS-PAGE gels. 
20 Uncut SREBP-1 (proSREBP-1) and intermediate SREBP-1 (intSREBP-1) cleaved by 
SKI-1 are indicated with arrows. Cell lines express ProSKI-1 (pSKl), SKI-1 anti-sense 
( SKI-1 as), SREBP-1, or ProSKI-1 and SREBP-1 ( pSKl + SRE ), or control vector ( 
pcDNA3), as indicated. Analysis was performed in the presence ( + sterols) or absence 
of sterols ( - sterols). 

25 Fig. 19 [A] is a schematic representation of the structure of FL-SKI-1 and its truncation 
mutant BTMD-SKI-1 . The various SKI-1 domains depicted are, respectively, the signal 
peptide, pro-segment, catalytic domain, and the C-terminal region comprising a 
cytokine receptor/growth factor motif, a transmembrane domain and a cytosolic tail. 
The positions of polypeptides used to produce SKI-1 -specific antisera (Ab: P, N and 

30 S) are also displayed. Fig. 1 9 [B] shows the biosynthetic analysis of SKI-1 . W:FL-SKI- 
1 , BTMD-SKI-1 (bSKI-1) or control W:WT infected LoVo cells were pulse-labeled with 
[ 35 S]Cys for 3h. Media were immunoprecipitated with either Ab:S or Ab:P and then 
resolved by SDS-PAGE on an 8 % gel followed by autoradiography. Arrows point to 
the migration positions of the 100 kDa BTMD-SKI-1 (bSKl-1), the 98 kDa shed form 
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(sSKI-1) as well as the 14 kDa prosegment product. Fig. 19 [C] shows a Western blot 
analysis of the overexpressed BTMD-SKI-1. Samples from W:WT or BTMD-SKI-1 
infected BSC 40 cells (left and middle panel) were processed as described in 
"Experimental Procedures" and run on an 8 % SDS-PAGE reducing gel. Following 
5 electrotransfer to PVDF membranes, protein bands were visualized via ECL detection 
using primary rabbit antisera Ab:S or Ab:A/. Purified BTMD-SKI-1 (right panel, *) was 
obtained from a Ni 2+ affinity resin as described in "Experimental Procedures", then 
processed as described above. A mixture of Ab:S and Ab:P were used as primary 
antisera. Elution buffer was used as a control (CTL). 

10 Fig. 20 shows the biosynthetic analysis of the rate of zymogen processing and the fate 
of the prosegment of SKI-1. LoVo cells overexpressing W:FL-SKi-1 were pulse- 
labeled with [ 3 H.]l_eu for 15 min and then chased for 2h (P15C2h), or pulsed for 2h in 
the presence or absence of BFA (P2h). Cell lysates were immunoprecipitated with 
Ab:P, resolved by SDS-PAGE on a 14 % gel and autoradiographed. The migration 

15 positions of the major ~26, 24, 14, 10 and 8 kDa prosegments are emphasized. 

Fig. 21 illustrates the purification and identification of secreted recombinant pro-SKI-1 . 
[A] Media obtained from HK293 cells stably expressing FL-SKI-1 were concentrated 
and sequentially applied to C4 semi-preparative column (not shown) followed by a C4 
analytical RP-HPLC columns, and then eluted by the indicated linear CH 3 CN gradient. 

20 [B] The fractions labeled l-IV were collected and analyzed by Western blotting using 
the primary antiserum Ab:P. [C,D] Proteins contained in fraction IV were separated on 
a 10 % SDS-PAGE reducing gel. Following electrotransfer, the proteins were stained 
with Ponceau Red. The immunoreactive 14 kDa and non-immunoreactive but colored 
- 4.5 kDa [D] polypeptides were excised and submitted to N-terminal sequencing (X 

25 represents an undefined residue). [E] Mass spectrometry analysis by MALDI-TOF 
spectrometry of fraction IV. The C-terminal residues sites believed to corresponding 
to the three -14 kDa polypeptides are underlined, whereas the expected (potential) 
cleavage sites are indicated by dashed arrows. 

Fig 22 shows the processing of proSKI-1 autocatalytic prosegement candidate 
30 sequences by purified, shed SKI-1. The proposed prosegment C-terminal mutant 17 
aa peptide IV [A] and 15 aa peptide IX [B] were digested for 18 h with metal chelation 
chromatography-purified BTMD-SKI-1. The cleavage products were separated by RP- 
HPLC using a 5 urn analytical Ultrasphere C18 column (Beckman) as described under 
"Experimental Procedures". The peptides contained in all but two peaks were identified 
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by mass spectrometry. The unidentified peaks are attributable to contaminating 
activities seen in WT/empty vector controls. 

Fig. 23 shows the processing of proBDNF and SREBP-2 peptides by BTMD-SKI-1 . 
The 14 aa peptide I [A] and 27 aa peptide II [B] were digested with BTMD-SKI-1 for 
5 150 and 60 min, respectively. The cleavage products were separated by RP-HPLC 
using a 5 urn analytical Ultrasphere C18 column (Beckman) as described under 
"Experimental Procedures". The peptides contained in the major peaks were identified 
by mass spectrometry and amino acid analysis (not shown). 

Fig. 24 shows the pH and Ca 2+ activation profile of BTMD-SKI-1. BTMD-SKI-1 from 

10 W-infected BSC40 cells was assayed as described under "Experimental Procedures" 
using a binary buffer system consisting of MES and HEPES, along with peptides I or 
II for the pH profile [A], and peptide II for the Ca 2+ profile [B]. The results represent the 
average ± SD (indicated as error bars) of three separate determinations. 
Fig. 25 is a X-ray film autoradiography showing in situ hybridization pattern for SKI-1 

15 mRNA (A) and APP mRNA (B) at the anatomical plane in sagital section from a 4-day 
mouse. Note similarity of distribution of SKI-1 and APP. A significant concentration of 
both SKI-1 and APP mRNA is revealed in the brain (Br), apinal cord (SpC), dorsal root 
ganglia (DRG), kidney (Ki), skin (Sk) submaxillary gland (SM) and bone tissue (B). 
Fig. 26 shows the comparative distribution of SKI-1 and APP in different regions of 

20 lacrimal gland of adult male mouse shown by immunocytochemistry. Peripherally 
located lobes display immunoreaction for both SKI-1 (A) and APP (B) in acinar cells. 
In the centrally located lobes the immunoreaction for SKI-1 (C) and APP (D) is 
confined to single cells distributed through the acini (medium arrows) and to 
intralobular ducts (long arrows). 

25 Fig. 27 illustrates the inhibition of proNGF processing. Rat Schwann cells were infected 
with either W:POMC (antigen control), or co-infected with W:NGF and either 
W:POMC (control), W.PDX, W:ppFurin or W:ppPC7. The cells were then pulse- 
labeled with [ 35 S]Metfor 4h and the media immunoprecipitated with an NGF antiserum. 
The migration positions of the 35 kDa proNGF and the 13.5 kDa NGF are shown. 

30 Fig. 28 illustrates the inhibition of proBDNF processing by furin. Western blot analysis 
of non-transfected (NT) COS-1 or cells transfected with pcDNA3 recombinants of 
proBDNF as control (BDNF) or together with recombinants expressing sense (S) or 
antisense (AS) ppPC7 or ppFurin. The secreted products resolved by SDS-PAGE were 
analyzed with a BDNF-specific antiserum [Santa Cruz ]. 
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Fig. 29 shows the biosynthetic analysis of the fate of the prosegment of SKI-1 . 

(A) Zymogen processing of [ 3 H] Leu SKI-1 in LoVo cells. LoVo cells overexpressing 
vaccinia virus full length SKI-1 were pulse-labeled for 15 min with [ 3 H] Leu and then 
chased for 2h (P15C2h). Cell lysates were immunoprecipitated with antibody to the 

5 prosegment, resolved by SDS-PAGE on a 14% gel and the dried gel 
autoradiographed. The migration positions of the major 26, 24, 14, 10 and 8 kDa 
prosegments are emphasized. 

(B) Zymogen processing of [ 3 H] Leu SKI-1 in BSC40 cells. BSC40 cells overexpressing 
vaccinia virus SKI-1 prosegment were pulse-labeled for 30 min with [ 3 H] Leu and then 

10 chased for 2h (P30C2h). Cell lysates were immunoprecipitated with antibody to the 
prosegment, resolved by SDS-PAGE on a 14% gel and the dried gel 
autoradiographed. The migration positions of the 24 and 14 kDa prosegments are 
emphasized. 

Fig. 30 shows the inhibition of hcc 4 processing in stable transfectants of Jurkat T cells 
1 5 expressing the mPC5 prodomain mutated at Arg 84 to Ala. The cell surface proteins of 
25 X 10 6 cells were biotinylated and immunoprecipitated with monoclonal ha 4 antibody 
( HP 2/1). Following SDS gel electrophoresis under reducing conditions and blotting 
to nitrocellulose the 80 kDa cleavage product was revealed by the chemiluminescence 
detection of anti-biotin streptavidin horse radish peroxidase. 

20 

EXAMPLE 1: 

MATERIALS AND METHODS 

Polymerase Chain Reaction and Sequencing. Most reverse transcriptase 
polymerase chain reactions {RT-PCR) were performed using a Titan One Tube RT- 

25 PCR system (Boehringer Mannheim) on 1 ug of total RNA isolated from either a 
human neuronal cell line (IMR-32), mouse corticotrophic cells (AtT20), or rat adrenal 
glands using a TRIzol reagent kit (Life Technologies). The active site degenerate 
primers were: His (sense) 5'-GGICA(C,T)GGlACI(C,T)(A t T)(C,T)(G,T)(T,G)IGClGG-3 , 
andSer{an//se/7se)5 , -CClG(C,T)IACI(TA)(G,C)IGGl(G ) C)(T,A)IGCIACI(G,C)(A,T)GTICC-3' 

30 based on the sequences GHGT(H,F)(V,C)AG and GTS(V,M)A(T,S)P(H,V)V(A,T)G, 
respectively. The amplified 525 bp products were sequenced on an ALF DNA 
sequencer (Pharmacia). To obtain the full length of rat and mouse SKI-1, we used 
PCR primers based on the human (12) and mouse sequences, in addition to 5' (13) 
and 3' (14) RACE amplifications. To avoid errors, at least three clones of the amplified 
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cDNAs were fully sequenced. The GenBank accession numbers of the 3788 bp mouse 
mSKI-1 cDNA and 3895 bp rat rSKI-1 are AF094820 and AF094821 , respectively. 

Transfection and Metabolic Labeling. Human SKI-1 (nt 1-4338) (12) in 
Bluescript (a generous gift from Dr. N. Nomura, Kazusa DNA Research Institute, 
5 Chiba, Japan; gene name KIAA0091 1 accession No. D42053) was digested with Sacll 
(nt 122-4338) and inserted into the vector PMJ602. The construct was digested with 
5' Kpni/3' Nhel, cloned into the Kpnl/Xbal sites of pcDNA3 (Invitrogen), and the cDNA 
transfected into HK293 cells with a DOSPER liposomal transfection reagent 
(Boehringer Mannheim). A number of stable transfectants resistant to G418 and 

1 0 positive on western blots using a SKI-1 antiserum (see below) were isolated, and one 
of them (clone 9), was further investigated. Cells were pulsed for 4h with [ 35 S]Met and 
the media and cell lysates immunoprecipitated with SKI-1 antisera directed against 
either amino acids (aa) 634-651, or aa 217-233, or a pro-SKI-1 antiserum directed 
against the pro-segment comprising aa 18-188 (Fig. 1). Immune complexes were 

1 5 resolved by SDS-PAGE on a 6% polyacryiamide/Tricine gel (15). 

Northern Blots, in situ Hybridizations and Immunocytochemistry. Northern 
blot analyses (16) were done on total RNA from adult male rat tissues using either a 
TRIzol reagent kit (Life Technologies) or a Quick Prep RNA-kit (Pharmacia)and on 
polyA+ RNA of (male + female) rat adult tissues (Bio/Can Scientific). The blots were 

20 hybridized overnight at 68°C in the presence of [ 32 P]UTP SKI-1 cRNA probes, 
consisting of the antisense of nucleotides 655-1249 of rat SKI-1 (accession No. 
AF094821). For ISH, the same rat sense and antisense cRNA probes were doubly 
labeled with uridine and cytosine 5'-{A-[ 35 S]thio}triphosphate (16). The distribution of 
SKI-1 mRNA in different tissues of adult and newborn rat (P1) after emulsion 

25 autoradiography was investigated. Relative densities of specific SKI-1 mRNA labeling 
per cell in selected organs have been measured upon counting of silver grains 
produced by antisense SKI-1 riboprobes and subtraction of non-specific background 
produced with sense SKI-1 riboprobes. Countings were made under 1000-fold 
microscopical magnification in the similar regions of adjacent sections stained with 

30 hematoxylin and eosin. Results are the mean ( S.E.D. of 10 - 16 readings / cell type. 
Newborn rats were frozen at - 35°C in isopentane and then cut into 14-um sagital 
cryostat sections (1 , 16). After hybridization, all tissue slides were exposed for 4 or 30 
days to X-Ray film or emulsion autoradiography, respectively. For immunofluorescence 
staining we used a rabbit anti-SKI-1 antiserum at a 1:100 dilution and rhodamine- 
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labeled goat anti-rabbit IgGs diluted 1:20 (16). Red SK1-1 immunostaining was 
compared with green staining patterns of both fluorescein-labeled concavalin A (ConA; 
Molecular Probes, OR), an ER marker, or fluorescein-conjugated wheat germ 
agglutinin (WGA; Molecular Probes, OR), a Golgi marker (17). 
5 Ex vivo and in vitro proBDNF Processing. A vaccinia virus recombinant of 

human SKI-1 (vv:SKI-1) was isolated as previously described for human proBDNF 
(vv:BDNF) (15). The vaccinia virus recombinants of the serpins a 1 -antitrypsin 
Pittsburgh (cc1-PlT; w:PIT) and a 1 -antitrypsin Portland (a1-PDX; vv:PDX) (18) were 
generous gifts from Dr. G. Thomas (Vollum institute, Portland, OR). For analysis of the 

10 cleavage specificity of hSKI-1 , 4 x10 6 COS-7 cells were co-infected with 1 pfu/cell of 
vv:BDNF and either the wild type virus (vv:WT) alone at 2 pfu/cell or with 1 pfu/cell of 
each virus in the combinations: [vv:SKI-1+w:WT], [vv:SKI-1+vv:PIT] and [vv:SKI- 
1+w:PDX]. At 10h post infection, cells were pulse labeled for4h with 0.2 mCi [ 35 S]Cys- 
Met (Dupont). Media and cell extracts were immunoprecipitated with a BDNF 

15 antiserum (19; kindly provided by Amgen) at a concentration of 0.5 ug/ml. The 
precipitates were resolved on polyacrylamide gradient gels (13-22%) and the 
autoradiograms obtained as described (15). Microsequencing analysis was performed 
on the [ 35 S]Met-labeled 32 kDa proBDNF and [ R]Leu-labeled 28 kDa BDNF, as 
described (20). For in vitro analysis, the 32 kDa proBDNF obtained from the media of 

20 LoVo cells infected with vv:BDNF was incubated overnight with the shed form of SKI-1 
obtained from rat Schwann cells (16) co-infected with vv:SKI-1 and vv:PDX, either at 
different pHs or at pH 6.0 in the presence of selected inhibitors: pepstatin (1 uM), 
antipain (50 uM), cystatin (5 uM), E64 (5 uM), soya bean trypsin inhibitor (SBTI, 5 uM), 
0.5 M phenyimethylsulfonyl fluoride (PMSF) + 50 uM para-aminophenylmethylsulfonyl 

25 fluoride (pAPMSF), o-phenanthroline (5 mM) and EDTA (10 mM). The products were 
resolved by SDS-PAGE on a 15% polyacrylamide gel, transferred to a PVDF 
membrane and then probed with a BDNF antiserum (Santa Cruz) at a dilution of 
1:1000. 
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RESULTS 

Protein Sequence Analysis of SKI-1. We first aligned the protein sequences 
within the catalytic domain of PC7 (21), yeast subtilases and bacterial subtilisins, 
together with that of a novel subtiKstn-iike enzyme from Plasmodium falciparum (J-C. 
5 Barale et ai, submitted). This led to the following choice of conserved amino acids 
around the active sites His and Ser: GHGT(H/F)(V/C)AG and 
GTS(MA/)A(T/S)P(HA/)V(A/T)G, respectively. Thus, using degenerate oligonucleotides 
coding for the sense His and antisense Ser consensus sequences we initiated a series 
of RT-PCR reactions on total RNA (see Materials and Methods) and isolated a 525 bp 

10 cDNA fragment from the human neuronal cell line IMR-32. This sequence was found 
to be 100% identical to that reported for a human cDNA called KIAA0091 (Accession 
No. D42053) obtained from a myeloid KG-1 cell line (12) and 88 % identical to that of 
a 324 bp EST sequence (Accession No. H31838) from rat PC12 cells. We next 
completed the rat and mouse cDNA sequences following RT-PCR amplifications of 

15 total RNA isolated from rat adrenal glands and PC12 cells, and from mouse AtT20 
cells. Starting from the equivalent rat and mouse 525 bp fragments, the complete 
sequences were determined using a series of RT-PCR reactions with human-based 
oligonucleotides in addition to 5' (13) and 3' (14) RACE protocols. As shown in Fig. 1 , 
alignment of the protein sequence deduced from the cDNAs of rat, mouse and human 

20 SKI-1 revealed a high degree of conservation. Rat and mouse SKI-1 share 98% 
sequence identity and a 96% identity to human SKI-1 . Interestingly, within the catalytic 
domain (Asp 218 to Ser 414 ) the sequence similarity between the three species is 100%. 
Analysis of the predicted amino acid sequence suggests a 17 aa signal peptide, 
followed by a putative pro-segment beginning at Lys 18 and extending for some 160-180 

25 amino acids. The proposed catalytic domain encompasses the typical active sites 
Asp 218 , His 249 and Ser 414 and the oxyanion hole Asn 338 . This domain is followed by an 
extended C-terminal sequence characterized by the presence of a conserved growth 
factor / cytokine receptor family motif C^LDDSHRQKDCFW 861 . This sequence is then 
followed by a potential 24 aa hydrophobic transmembrane segment and a less 

30 conserved 31 aa cytosolic tail that remarkably consists of 35% basic residues. Some 
of the clones isolated from rat adrenal glands suggested the existence of alternatively 
spliced rSKI-1 mRNAs in which the segments coding for aa 430-483 or 858-901 are 
absent. Finally, the phylogenetic tree derived from the alignment of the catalytic 
domain of SKI-1 with subtilases (22) suggests that it is an ancestral protein that is 
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closer to plant and bacterial subtilases than to either yeast or mammalian homologues 
(not shown). 

Tissue Distribution of SKI-1 mRNA. Northern blot analyses of SKI-1 mRNA 
in adult male rat tissues reveal that rSKI-1 mRNA is widely expressed and is 
5 particularly rich in anterior pituitary, thyroid and adrenal glands (Figs. 2A and 8). A 
Northern blot of polyA+ RNA obtained from mixed adult male and female rat tissues 
also showed a wide distribution and a particular enrichment in liver (Fig. 2B). Similarly, 
analysis of 24 different cell lines (23) revealed a ubiquitous expression of SKI-1 mRNA 
(not shown). 

10 In situ hybridization data obtained in a day 2 postnatal rat also provided 

evidence of a widespread, if not ubiquitous distribution of rSKI-1 mRNA. Figure 3 
shows at the anatomical level the presence of SKI-1 mRNA in developing skin (see 
also Figure 9), striated muscles, cardiac muscles, bones and teeth as well as brain and 
many internal organs. Strong hybridization signals were detectable in the retina, 

15 cerebellum, pituitary, submaxillary, thyroid and adrenal glands, molars, thymus, kidney 
and intestine. Evidence for the cellular expression of rSKI-1 mRNA was obtained from 
analysis of the relative labeling densities per cell in selected tissues, based on a 
semiquantitative analysis of emulsion autoradiographies (not shown). In the central 
nervous system (CNS) rSKI-1 mRNA labeling was mostly confined to neurons, 

20 whereas ependymal cells, supportive glial cells, such as presumed astrocytes, 
oligodendrocytes, and microglia, exhibited 5-30 fold less labeling/cell (see Table 1 and 
Figure 10). In addition, within the peripheral nervous system (PNS) trigeminal ganglia 
reveal a 5-10 fold greater expression in neurons as compared to presumptive Schwann 
cells (Figures 1 1 and 12 and Table 1). Labeling was observed in most of the glandular 

25 cells in the anterior and intermediate lobes of the pituitary as well as in the pituicytes 
of the pars nervosa. A semiquantitative comparison in the adult and newborn rat 
pituitary gland, submaxillary gland, thymus and kidney demonstrated an overall 2-fold 
decreased labeling of rSKI-1 mRNA with age (not shown). 

Biosynthesis of hSKI-1. To define the molecular forms of human SKI-1 and 

30 their biosynthesis, we generated both a vaccinia virus recombinant (vv:SKI-1) and a 
stable transfectant in HK293 cells. Three antisera were produced against aa 18-188 
(prosegment), 217-233 and 634-651 of SKI-1. Expression of w:SKI-1 in 4 different cell 
lines revealed that the enzyme is synthesized as a 148 kDa proSKt-1 a zymogen which 
is processed into 120, 106 and 98 kDa proteins. In this system, both the 148 and 120 
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kDa forms are recognized by the Pro-domain antiserum, whereas all 4 forms react with 
the other two antisera. Processing of the 148 kDa proSKI-1a into the 120 and 106 kDa 
forms occurs in the ER based on the presence of these proteins in cells pre-incubated 
with the fungal metabolite brefeldin A (see 24 for refs., not shown). The same SKI-1 - 
5 related forms are also observed in stably transfected HK293 cells following a 4h pulse 
labeling with [ 35 S]Met (Fig.4). The results reveal the intracellular formation of a 
secretable 98 kDa form (SKI-1s) recognized by both of the SKI antisera but not by the 
Pro antiserum. These data demonstrate that the 148 kDa proSKI-1a is N-terminally 
cleaved into an intermediate 120 kDa form containing part of the prosegment (proSKI- 

10 1b) which is then further excised to form a non secretable 106 kDa SKI-1. This 
suggests that two cleavages occur within the prosegment prior to the formation of the 
presumably membrane-bound 106 kDa form which is later shed into the medium as 
a 98 kDa soluble SKI-1 s. 

Intracellular localization of SKI-1. Double staining immunofluorescence was 

1 5 used to compare the intracellular localization of the stably transfected human SKI-1 in 
HK293 cells and that of either the ER or Golgi markers ConA and WGA (17), 
respectively. The data show that SKI-1 exhibits: (i) peripheral nuclear staining, 
coiocaiizing with ConA fluorescence, presumably corresponding to the ER {not shown); 
(ii) paranuclear staining coiocaiizing with WGA fluorescence, suggesting the presence 

20 of SKI-1 in the Golgi (Fig. 5A,B) and (iii) punctate staining observed in the cytoplasm 
and within extensions of a few cells (Fig. 5A). Some, but not all of the punctate 
immunostaining matched that observed with WGA. This suggests that SKI-1 localizes 
in the Golgi but may sort to other organelles, including lysosomal and/or endosomal 
compartments. Since in HK293 cells we observed scant immunoreaction to either 

25 cathepsin B or cathepsin D (not shown), we could not directly assess the presence of 
SKI-1 within iysosomes. An indication of lysosomal/endosomal localization was 
provided by the analysis of SKI-1 immunofluorescence within cells pre-incubated for 
4h with 10 mM leucine-methyl ester (LME), a specific lysosomal/endosomal protease 
inhibitor (25). The results showed a net increase in the proportion of cells exhibiting 

30 punctate staining (Fig. 5C) as compared to control cells. Thus, SKI-1 immunoreactivity 
is enhanced upon LME inhibition of lysosomal/endosomal hydrolases. 
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Enzymatic Activity and Cleavage Specificity of SKI-1. To prove that SKI-1 
is a proteolytic enzyme we examined its ability to cleave five different potential 
precursor substrates. Our choice was based on the tissue expression pattern of SKI-1 
(Figs. 2, 3), which led us to select pro-opiomelanocortin (pituitary), pro-atrial natriuretic 
5 factor (heart), HiV gp160 (T-iymphocytes) and based on its neuronal expression, pro- 
nerve growth factor and pro-brain-derived neurotrophic factor (proBDNF). Cellular co- 
expression of vv:SKI-1 with the vaccinia virus recombinants of each of the above 
precursors revealed that only proBDNF could be cleaved intracellular^ by SKI-1. Thus, 
upon expression of vv:BDNF alone in COS-7 cells we observed a partial processing 

10 of proBDNF (32 kDa) into the known major 14 kDa BDNF product (1 5), and the minor 
production of a previously observed (16; Mowla, S.J. et ai, submitted) but still 
undefined 28 kDa product (Fig. 6A). Upon co-expression of proBDNF and SKi-1, a net 
increase in the level of the secreted 28 kDa BDNF is evident, without significant 
alteration in the amount of 14 kDa BDNF (Fig. 6A). To examine whether the 28 kDa 

15 product results from cleavage at a basic residue or at an alternative site, we first co- 
expressed proBDNF, SKI-1 and either a1-PIT or a1-PDX which are inhibitors of 
thrombin and PC cleavages, respectively (18, 26). The results show that different from 
cc1-PiT, the serpin a1-PDX selectively blocks the production of the 14 kDa BDNF and 
that neither a1-PIT nor a1-PDX affect the level of the 28 kDa product. This 

20 demonstrates that a1-PDX effectively inhibits the endogenous furin-like enzyme(s) 
responsible for the production of the 14 kDa BDNF (1 5), but does not inhibit the ability 
of SKI-1 to generate the 28 kDa product. Thus, it is likely that the generation of the 28 
kDa BDNF takes place via an alternate cleavage. Incubation of the ceils with the Ca 2+ 
ionophore A23187 abolished the production of both the 14 and 28 kDa products (not 

25 shown), supporting the notion that similar to the PCs (1-3, 24), SKI-1 is a Ca 2+ - 
dependent enzyme. 

In Fig. 6B, we present the N-terminal microsequence analysis of [ 35 S]Met- 
labeled 32 kDa proBDNF and [ 3 H]Leu-labeled 28 kDa BDNF. The sequence of the 32 
kDa form revealed the presence of an [ 35 S]Met at position 3 (Fig. 6B), which is in 

30 agreement with the proposed sequence of human proBDNF (27) resulting from the 
removal of an 18 aa signal peptide cleaved at GCMLA 18 IAPMK site. The N-terminal 
sequence of the 28 kDa product revealed a pH]Leu at positions 2, 13 and 14 (Fig. 6B). 
This result demonstrates the 28 kDa BDNF is generated by a unique cleavage at Thr 57 
in the sequence: RGLT 57 iSI_ADTFEHVIEELL (27). 
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To prove that SKI-1 is directly responsible for the production of the 28 kDa 
BDNF at the novel Thr-directed cleavage, we performed in vitro studies. Thus, 
proBDNF was incubated at various pHs with concentrated media of vv:SKI-1 -infected 
Schwann cells. A similar preparation obtained from wild type vaccinia virus-infected 
5 cells served as control. The data show that SKI-1 exhibits a wide pH dependence 
profile revealing activity at both acidic and neutral pHs between pH 5.5 up to 7.3 (Fig. 
7A) but also at pH 4.5 and 8 (not shown). Analysis of the inhibitory profile of this 
reaction revealed that metal chelators such as EDTA and o-phenanthroiine, or a 
mixture of the serine proteinase inhibitors PMSF + pAPMSF effectively inhibit the 
10 processing of proBDNF by SKI-1. The inhibition by EDTA is expected since like all 
PCs, SKI-1 is a Ca 2+ -dependent enzyme. The unexpected inhibition by 5 mM 
o-phenanthroline may be due to excess reagent since at 1 mM only 25% inhibition is 
observed (not shown). All other class-specific proteinase inhibitors (aspartyh 
cysteinyl-, and serine proteases- of the trypsin-type) proved to be inactive. 

15 

Table 1 



20 



Tissue 


Adult 

Silver arains/Cel! ± SED 


Newborn (PI) 
Silver Grains/Cell ±SED 


C.N.S. 












Neurons, large 


19.7 ±5.8 


ND* 


Neurons, medium & small 


5.7 ± 2.3 




Astrocytes, presumptive 


0.6 ± 0.5 








ND 


Neurons, pyramidal 


15.3± 3.9 




Neurons, granules 


23.7 ± 5.3 




Corpus callosum 




ND 


Oligodendrocytes, presumpt. 


0.6 ± 0.6 




Spinal cord 




ND 


Motorneurons 


27.8 ± 7.1 




Circumventricular oraans 




ND 


Plexus choroideux 


9.6 ± 1.9 




Ependyma (III ventr.) 


2.9 ± 0.8 
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P.N.S. 




ND 




Triaeminal aanalion 








Neurons, large 


14.6 ±4 






Satellite cells 


3.8 ± 22 




5 


Schwann cells, presumpt. 


1.3 ± 1.9 






Pituitary aland 








Anterior lobe cells 


4.9 ± 3.6 


9.3 ±2.1 




Intermediate lobe cells 


4.1 ±0.9 


7.2 ± 1.4 




Posterior lobe pituicytes 


3.6 ± 3.9 


6.7 ±4.2 


10 


Thvmus 








Cortical lymphocytes 


4.1 ± 0.7 


7.1 ± 1.0 


Q 


Medullary reticular cells 


2.7 ±1.0 


4.4 ± 0.9 


w 


Adipocytes 


0.3 ± 0.6 


ND 


CO 


Fibroblats 


0.2 ±0.1 


ND 


K 15 


Submaxillary gland 








Epithelial cells 


2.1 ± 1.0 


3.9 ± 1.7 




Acinar cells 


2.4 ± 1.2 


4.5 ± 1.7 




Kidney 








Glomerular cells 


2.8 ± 0.9 


4.2 ± 0.9 


r ~ 20 


Convoluted tubules 


4.1 ±2.7 


9.8 ± 1.4 



*ND = not determined 

25 DISCUSSION 

This work provides the first evidence for the existence of a mammalian 
secretory Ca 2+ -dependent serine proteinase of the subtilisin-kexin type that selectively 
cleaves at non-basic residues. Thus, SK1-1 processes the 32 kDa human proBDNF at 
an KAGSRGLTiSL sequence generating a 28 kDa form, which may have its own 
30 biological activity (Mowla, S.J. et al., submitted). Such a cleavage site is close to the 
consensus site deduced from a large body of work done with the PCs, whereby an 
(R/K)-(X) n -Rj X-(L/I/V), [where n=0, 2, 4 or 6] motif is favored by most PCs (1-3, 28). 
Note that in the SKI-1 site, P1 Arg is replaced by Thr and an aliphatic Leu is present 
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at P2\ an amino acid also favored by PCs (1-3, 28). Several proteins are known to be 
cleaved following Thr. These include human anti-angiogenic platelet factor 4 (6; 
QCLCVKTTiSQ) and angiostatin (7; KGPWCFTTlDP), the neuroendocrine a- 
endorphin (4; KSQTPLVTlLF), the ADAM-10 metalloprotease (8; LLRKKRTTlSA), as 
5 well as the amyloidogenic peptide AP43 (10; VGGWIATlVI). 

Interestingly, comparison of the phylogenetically highly conserved sequence of 
proBDNF revealed an insertion of hydroxylated amino acids (Thr and Ser) just after the 
identified SKI-1 cleavage site of human proBDNF. Thus, in rat and mouse proBDNF, 
two threonines are inserted (RGLTTT— SL) and in porcine proBDNF five serines added 

10 (RGLTSSSSS — SL) (27). These observations raised a number of questions: (i) do 
these insertions affect the kinetics of proBDNF cleavage by SKI-1? (ii) does SKI-1 
recognize both single and pairs of Thr and Ser and combinations thereof? (iii) is the 
presence of a basic residue at P4, P6 or P8 important for cleavage? and (iv) similar to 
enzymes cleaving at basic residues (29), does the possible phosphorylation at specific 

1 5 Thr or Ser residues affect substrate cleavability by SKI-1 ? Answers to these questions 
are provided hereinbelow. 

Biosynthetic analysis of the zymogen processing of proSK!-1 demonstrated a 
two-step ER-associated removal of the pro-segment (Fig. 4). Furthermore, analysis of 
the [ 35 S0 4 ]-Iabeled SKI-1 demonstrated only the presence of sulfated 106 and 98 kDa 

20 forms but not that of either the 148 or 120 kDa forms recognized by the Pro-segment 
antiserum (not shown). Since sulfation occurs in the trans Golgi network, this confirms 
that the removal of the pro-segment occurs in the ER. Like furin and PC5-B (1-3, 24) 
the membrane bound 106 kDa SKI-1 is transformed into a soluble 98 kDa form that is 
released into the medium by an as yet unknown mechanism. The secreted 98 kDa 

25 SKI-1 s is enzymatically active since it processes proBDNF in vitro (Fig. 7). Numerous 
attempts to sequence the SDS-PAGE purified [ 3 H]Leu and Val-labeled 148 kDa and 
98 kDa forms, resulted in ambiguous results, suggesting that SKI-1 is refractory to N- 
terminal Edman degradation. Presently, we cannot define the two zymogen cleavage 
sites leading to the sequential formation of the 120 kDa proSKI-1b and 106 kDa SKI-1 

30 deduced by pulse (Fig. 4) and pulse-chase studies (not shown). Examination of the 
pro-segment sequence (Fig. 1), the species-specific proBDNF motif potentially 
recognized by SKI-1 (see above), and the alignment of SKI-1 with other subtilases 
(22), suggests two possible conserved sites: RNNPSS 95 lDYPS and RHSS 182 lRRLL. 
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Both sites predict a cleavage after pairs of Ser with either a P6 or a P4 Arg r 
respectively. 

Phyiogenetic structural analysis of the predicted amino acid sequence of SKI-1 
reveals that this serine proteinase is closer to plant and bacterial subtilases than it is 
5 to yeast and mammalian PCs. The 100% conservation of the catalytic domain 
sequence, although striking and suggestive of an important function, is not far from the 
98% similarity between human and rat PC7 (3, 21). The sequence C-terminal to the 
catalytic domain of SKI-1 is very different from that of any of the known PCs. In fact, 
although PCs have a typical P-domain critical for the folding of these enzymes (for 

1 0 reviews see 1-3), we did not find the hallmark sequences (3, 30) of the P-domain within 
the SKI-1 structure. Instead different from the PCs, we find a conserved growth 
factor/cytokine receptor motif of which functional importance will need to be addressed, 
especially since this motif is partly missing in alternatively spliced forms (Fig. 1). 
Finally, the highly basic nature of the cytosolic tail of SKI-1 (Fig. 1) may be critical for 

15 its probable cellular localization within endosomal/lysosomal compartments (Fig. 5), 
similar to the importance of basic residues for the accumulation of the a-amidation 
enzyme PAM in endosomal compartments (Milgram, S.L., personal communication). 

The wide tissue distribution of SKI-1 mRNA transcripts suggests that this 
enzyme processes numerous precursors in various tissues. Furthermore, the observed 

20 developmental down-regulation of the level of its transcripts also suggests a functional 
importance during embryonic development. The fact that SKI-1 can cleave C-terminal 
to Thr and possibly Ser residues suggests that, like the combination of PCs and 
carboxypeptidases E and D (31), a specific carboxypeptidase may also be required to 
trim out the newly exposed C-terminal hydroxylated residues. Such a hypothesis may 

25 find credence in a report suggesting that the amyioidogenic AJ343 (ending at Thr) may 
be transformed in vitro into AJJ42 and A£40 by a brain-specific carboxypeptidase(s) 
(32). 

A recent report demonstrated the existence of a soluble subtilisin-like enzyme 
exhibiting a 29% sequence identity to SKI-1 in Plasmodium falciparum merozoites 
30 (PfSUB-1). This enzyme localizes to granular-like compartments and presumably 
cleaves at a LeulAsn bond (33). In that context, SKI-1 may represent the first member 
of an as yet undiscovered mammalian family of proteinases implicated in the limited 
proteolysis of proproteins at sites other than basic amino acids that may differ by their 
intracellular localization and cleavage specificity. 
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EXAMPLE 2 

Genetic and biochemical evidence indicates that SKI-1/S1p is the protease that 
cleaves sterol-regulatory element-binding proteins (SREBPs) which functions to control 
lipid biosynthesis and uptake in animal cells { Sakai, J. et al. (1998) Molecular Cell 2, 
5 505-514; Cheng, D. et al. (1999) J. Biol. Chem. 274, 22805-22812; Toure, A. et al. 
(1999) In: Peptides for the Now Millennium: Proceedings of the 16th American Peptide 
symposium }. SKI-1 and SREBPs play critical roles in the feedback pathways by which 
cholesterol suppresses transcription of genes encoding HMG CoA reductase and other 
enzymes of cholesterol biosynthesis as well as the low density lipoprotein ( LDL) 

10 receptor. A SKI-1 inhibitor would be of use under clinical conditions in which there is 
not sufficient down regulation of SREBP dependent transcription by sterols. For 
example, in the Nieman-Pick group of diseases a high sphingomylin content of cells 
leads to an increase in proteolysis of SREBP-2 and a subsequent increase in 
cholesterol biosynthesis { Scheek, S. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 

15 11179-11183; Spence, M.W., and Callahan, J.W. (1989) Spingomyelin-cholesterol 
lipidoses: The Nieman-Pick Group of Diseases. In The Metabolic Basis of Inherited 
Disease ( Scriver, C.R., Beaudet, A.L, Sly, W.S., and Vaile, D., editors ), McGraw-Hill 
Publ. Co., 6th edition, chapter 66, 1655-1675; Svirirodov, D. (1999) Histology & 
Histopathology 14 (1): 305-319 }. Perhaps of greater significance, nuclear SREBP-1c 

20 protein levels were significantly elevated in mouse models for non-insulin dependent 
diabetes, ob/ob and aP2 SREBP-1c mice, which was associated with elevated mRNA 
levels for known SREBP target genes involved in the biosynthesis of fatty acids 
(Shimomura, I. etal. J. Biol. Chem. 1999; 274:30028-30032). 

In addition, the inhibition of the SREBP- dependent transcription of farnesyl 

25 diphosphate synthase, like HMG-CoA reductase and farnesyl-protein transferase 
inhibitors, by inhibition of farnesyl pyrophosphate biosynthesis could potentially be 
useful to treat a number of diseases such as Ras-dependant cancers and restenosis 
( Reference - United States Patent 5,925,651). With regard to a potential treatment 
for restenosis, HMG-CA reductase inhibitors decrease smooth muscle (SMC) ceil 

30 migration and proliferation, and induce SMC apoptosis { Bellosta, S. et al. (1998) 
Atherosclerosis 137, S101-S109; Guijarro, C. et al. (1998) Circulation Research 83, 
490-500 }. 

As mentioned above, inhibition of PC activity seems to offer new therapeutical 
targets. Unfortunately, previous attempts using inhibitory peptides have failed either 
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due to cytotoxicity of used agents or poor targeting 17;18 . We have focused on the 
inhibitory properties of PC prosegments in order to find a safe and effective way for 
enzyme silencing. 

To study the effect of the SKI-1 prosegment (ProSki-1) on the SREBP 
5 processing and mediated transcriptional activity we isolated a cDNA fragment covering 
the 188 amino acids that make up the signal peptide and the prosegment of SKI-1 
including the predicted cleavage site RRLL 176 . This autocatalytic cleavage site was 
confirmed by mass spectral analysis and amino acid sequencing by other investigators 
19 . We isolated stable cell lines overexpressing SREBP-1 (neo resistance) and ProSki- 
10 1 plasmid (zeo resistance). A background SREBP-1 overexpression was used in order 
to improve detection of nuclear NH 2 -terminal segment of SREBP in immunoblot 
experiments. • 

The effect of ProSki-1 on target gene mRNA: mRNA expression in HK293 cells was 
studied by Northern blotting as described in the methods section. In wild type (wt), 

15 vector only, and SREBP overexpressor cells in presence of lipids the mRNA levels 
were low for all studied genes: LDL-receptor, HMG-CoA reductase, farnesyl 
diphosphate (FDP) (Fig. 13), and fatty acid synthase (FAS) (Fig. 14). However, when 
these ceils were treated with media containing no cholesterol a clear increase was 
observed in mRNA expression for all these genes, as demonstrated in earlier studies. 

20 Interestingly, corresponding mRNA levels were greatly reduced in both conditions in 
cells overexpressing ProSKI-1 and SREBP-1 suggesting that SREBP mediated 
transcription can be blocked efficiently by the prodomain mediated inhibition of the SKI- 
1 protease (Figs. 13 and 14). The effect was observed in early passages of previously 
frozen cell lines. However, when the same clones were kept in culture for future 

25 passages, in contrast to earlier findings the target gene mRNA levels were now normal 
or even higher than in control cells. (Figure 15). This finding suggests that cells can 
adapt to new conditions and maintain their lipid homeostasis even without SREBP 
mediated regulation and synthesis. This finding was supported in another experiment 
with several cell lines overexpressing SREBP-1 or SREBP-1 and ProSki-1 (Fig. 16). 

30 While HMG CoA reductase and farnesyl diphosphatase varied markedly between 
different cell lines containing only SREBP-1 (Fig. 16, lanes 1-5), mRNA levels 
measured from cells overexpressing ProSki-1 and SREBP-1 (Fig. 16, lanes cl4, cl6, 
and c!9) showed no variation and were higher than in SREBP-1 cells. 
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The effect of ProSki-1 on nuclear SREBPs: Western blot experiments were 
performed to illustrate the effect of ProSKI-1 on SREBP-1 processing in these cells. 
After staining with an antibody against the NH 2 -terminal end of SREBP-1 a band 
around 60 kDa appeared on blots of nuclear extracts (Fig. 17), as demonstrated earlier 
5 by other investigators 2 3 . As expected, only a weak signal was detected in presence 
of sterols. In absence of sterols a significant increase was observed, especially in 
SREBP-1 cells. Only minute amounts of nuclear SREBPs were detected when ProSKi- 
1 was present suggesting that sterol mediated proteolysis of SREBPs is efficiently 
blocked in these cells in presence of ProSki-1 (Figure 17 shows the data from clones 

10 6 {lane 5} and 9 {lane 6}). 

The inhibitory effect of ProSKI-1 was further demonstrated by studying the 
processing of cytoplasmic full length SREBP-1 (proSREBP-1) (Fig. 18). The 
processing of proSREBP-1 by SKI-1 / S1P into intermediate (intSREBP-1) forms 
shown previously by other investigators 19 , was clearly demonstrated in clones 

15 overexpressing SREBP-1 . Significantly, in cell lines overexpressing SREBP-1 together 
with the inhibitory prodomain of SKI-1 (pSKI + SRE) accumulation of the proSREBP-1 
was observed and formation of the intermediate form(s) of SREBP-1 was abolished. 
These results, along with the observed reduction in nuclear SREBP ( Fig. 17), indicate 
that ProSKI-1 efficiently inhibits SKI-1 protease activity and blocks SREBP processing 

20 in mammalian cells. In addition, the specificity of ProSKI-1 inhibition was studied by 
using a substrate not processed by SKI-1 (neurotrophin-3; NT-3). Both the level and 
furin-derived processing of NT-3 were unaffected by the presence of ProSKI-1 (not 
shown). These results suggest that ProSKI-1 is SREBP- and pro-BDNF- specific and 
that it does not affect other secretory proteins which are not substrates for SKI-1 . 

25 In these experiments a pro-domain was successfully used for the first time as 

a subtilase inhibitor in vivo. ProSki-1 seems to be a promising therapeutical tool for 
SREBP-mediated pathologies, which may or may not be directly related to cholesterol 
or fatty acid homeostasis. For instance SREBP-dependent isoprenoids, such as 
farnesol and geranylgeraniol, have been shown to associate e.g. with endothelial nitric 

30 oxide synthetase (eNOS) 20 * 23 , vascular smooth muscle proliferation and migration as 
well as ras-protein mediated cell proliferation 24 ~ 28 . Furthermore, links to PPAR-y 
mediated signaling system including adipocyte differentiation and insulin resistance 
have already been reported 29 ' 33 . This novel prosegment approach to inhibit enzyme 
activity will certainly also inspire other investigators in different fields, since it may be 
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possible to specifically inhibit other enzymes with this prosegment technology leading 
to new treatments for a variety of diseases. On the other hand, these results provide 
new data supporting the existence of an SREBP-independent, but lipid dependent (Fig. 
3) control of the lipid homeostasis in human cells, although the alternative sensor of 
5 lipids under these conditions is currently unknown. 

Materials and Methods 
Materials: 

Cell Culture: HK293 cells were maintained as monolayers in Dulbecco's modified 

10 Eagle's medium containing 100 units / ml penicillin and 100 ug / ml streptomycin 
sulfate (medium A) supplemented with 10 % fetal caif serum. 24 hours before RNA and 
protein extractions medium A was supplemented with 5 % lipoprotein deficient serum, 
50 uM mevalonate (Sigma), 50 uM compactin (Sigma) and with no sterols or 1 ug/ml 
of 25-hydroxy-cholesterol and 10 ug/ml of cholesterol. 4 hours before protein extraction 

15 25 ug/ml N-acetyl-leucinyl-leucinyl norleucinal was added. Total RNA was isolated with 
Trizol (Gibco BRL) reagent according to the instructions of the manufacturer. In order 
to extract proteins cells were washed and collected in PBS with protease inhibitors (). 
After addition of buffer A (Triton x 100 1 %,50 mM tris maleate, 2 mM CaCI 2 , inhibitor 
coctail (), and ALLN) cells were mixed with pipette and allowed to swell on ice for 20 

20 minutes Then the solution was centrifuged for 5 minutes at 15 000 rpm and 
supernatants representing membrane proteins were collected and stored until analyzed 
at -70 °C. Remaining pellets were resuspended in Buffer B (20 mM Tris pH 7.9, 400 
mM NaCI, 1mM EDTA, 1mM EGTA, and protease inhibitors). Samples were shaken 
at 4 C for 1 hour and centrifuged and the supernatant was frozen in aliquots at -70 °C. 

25 Plasmid constructions: SKI-1 prosegment containing aa 1-188 was isolated by PCR 
using following oligonucleotides: [5' GGA TCC GAA GAA ACA TCT GGG CGA CAGA 
3'] and [5' CTC GAG GGC TCT CAG CCG TGT GCT 3'] and cloned into PCR 2.1 TA 
cloning vector for sequencing. After that it was subcloned into the pcDNA 3zeocin vector 
(Invitrogen) (BamHI / Hindlll sites) for transfections. 

30 SREBP-1 in bluescript IISK (ATCC 79810) subcloned into Sail / BamHI sites 

of the pcDNA 3genetlcin . 

Transfections: HK293 cells were plated at a density of 5x1 0 5 / 60 mm dish in medium 
A with 10 % fetal caif serum and were cultured until they were 40-60 % confluent. The 
cells were then transfected with 10 ug plasmid DNA (pcDNA 3neo , pcDNA 3neo -SREBP-1 , 
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pcDNA 3neo -SREBP-1 and pcDNA 3z gProSKI-1) using Lipofectin reagent (Life 
Technologies, city, state) according to manufactures instructions. On day two medium 
containing appropriate selection agents (800 ug/ml Geneticin for pcDNA 3neo , xOO ug/ml 
Zeocin for pcDNA 3zeo ) were added. The medium was changed every two days until 
5 defined colonies were evident. Colonies were isolated and formed stable cell lines 
were analyzed by immunoblotting with ProSKI-1 and SREBP-1 antibodies. 
Northern blotting: 20 ug of total RNA was electroforetically separated in an 1.0 % 
agarose gel, and transferred to Hybond N + filters (Amersham, city, state) by capillary 
blotting. After transfer filters were crosslinked by UV irradiation in a Stratalinker 

10 (Stratagene). Filters were prehydridized at 42 °C for 1 hour and hybridized with 
random labeled 32 P cDNA probes for 16-20 hours. Ultrahyb buffer (Ambion) was used. 
After hybridization filters were washed and exposed to film for indicated time and 
bands were quantified by densitometry. Following primer pairs were used to clone 
cDNA probes: HMG CoA reductase [5' GAG GAA GAG ACA GGG ATA AAC 3'] [5' 

1 5 GGG ATA TGC TTA GCA TTG AC 3'], farnesyl diphosphate [5' AGC CCT ATT ACC 
TGA ACC TG 3'], [5' GAA TCT GAA AGA ACT CCC CC 3'], Fatty acid synthase [5' 
TTC CGA GAT TCC ATC CTA CG 3'], [5' TGC AGC TCA GCA GGT CTA TG 3'], 
Acetyl CoA carboxylase [ 5' TCT CCT CCA ACC TCA ACC AC 3'], [5' CCA GCC TGT 
CAT CCT CAA TAT C 3'], SREBP-1 [5' GGA GCC ATG GAT TGC ACT TTC 3'], [5' 

20 AGG AGC TCA ATG TGG CAG GA 3'], LDL-receptor [5' 3'], [5' 3']. Amplification 
products were cloned into pGEM (Promega) and sequenced. 18S cDNA was 
purchased from Ambion. 

Immunoblot analysis: 50 ug of nuclear extract and membrane fractions were 
separated in an SDS-PAGE gel. After electrophoresis proteins were transferred to a 
25 nitrocellulose membrane. Membranes were stained with appropriate primary SREBP-1 
(Santa Cruz), ProSki-1 and secondary antibodies. After washing chemiluminescent 
substrate (Santa Cruz) was added, and membranes were exposed to x-ray film for 1- 
30 min. Gels were calibrated with prestained molecular weight markers (New England 
Biolabs). 

30 

EXAMPLE 3 

The soluble SKI-1 isoform, collected from cell media, was used to study the in 
vitro cleavage properties of this enzyme on a number of synthetic substrates. In 
addition, we present data on the in vitro inhibitory character of three prosegment 
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constructs of SKI-1 , which we obtained as bacterial recombinant proteins. Moreover, 
we examined the processing of hSKI-1 in LoVo cells infected with a W recombinant 
as well as in a stable transfectant of HK293 cells (10). 

EXPERIMENTAL PROCEDURES 
5 Vaccinia Virus Recombinant ofBTMD-SKi-1 - The preparation of a soluble form 

of hSKI-1 involved the initial amplification by polymerase chain reaction (PCR) of a 
1250 base pair (bp) product encompassing nucleotides (nts) 491-1740 of the hSKI-1 
cDNA (12), which includes the initiator methionine. The sense (s) and antisense (as) 
oligonucleotides were 5' GTGACCATGrAAGCTTGTCAACATCTGG 3' and 5' 

1 0 ACACTGGTCCCTGAGAGGGCCCGGCA 3', respectively. This completely sequenced 
fragment, which had been inserted into the PCR2.1 TA cloning vector (Invitrogen), was 
first digested with Notl and Accl. It was then ligated with the similarly digested full- 
length hSKI-1 cDNA 3.5 kb product, resulting in a product called 5' hSKI-1-FL. In order 
to obtain a soluble form of hSK!-1 with a hexa-His sequence just before the stop 

15 codon, PCR amplification was carried out using the sense and antisense 
oligonucleotides: 5' ATTGACCTGGACAAGGTGGTG 3" and 5' 
GGATCCTCTAGATCAGTGGTGGTGGTGG- 
TGGTGGTGCTCCTGGTTGTAGCGGCCAGG 3'. This resulted in a 165 bp fragment 
encoding the C-terminal sequence PGRYNQE 997 -(H 6 )* (10). Following digestion with 

20 5' EcoNl and 3' Xbai, the product was ligated to the aforementioned and similarly 
digested 5' hSKI-1-FL. This cDNA, coding for BTMD-SKI-1 ending with a hexa-His 
sequence, was then transferred to the BamH1/Xbal site of the (W) transfer vector 
PMJ601. A recombinant was then isolated as previously reported (13). The W 
recombinant of full-length hSKI-1 has been described (10). 

25 Biosynthetic Analyses - Seventeen hours following infection with 2 pfu each of 

W:SKI-1 and W:BTMD-SKI-1 recombinants, human LoVo cells (3 x 10 6 ) were 
radiolabeled with 500 uCi of [ 3 H]Leu for 2h or pulsed for 15 min followed by a chase 
of 2h, in the presence or absence of 5 ug/ml of the fungal metabolite brefeldin A (BFA) 
as described (10,14). Media and cell lysates were immunoprecipitated with SKI-1 

30 antiserum directed against either aa 634-651 , or the prosegment comprising aa 1 8-1 88 
(10). Immune complexes were resolved by SDS-PAGE on an 8% or 14% 
polyacrylamide/Tricine gel (10) and the dried gels autoradiographed (10,14). All 
biosynthesis experiments were performed at least twice. 
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Isolation and Purification of Recombinant hSKI-1 Prosegments - Three N- 
terminal fragments of hSKI-1 were isolated by PCR using a common (s) 
oligonucleotide [5' GGATCCGAAGAAACATCTGGGCGACAGA 3'] and one of three 
(as) oligonucleotides [5' CTCGAGGGAGAGGCTGGCTCTTCG 3'J, [5' 
5 CTCGAGGGCTCTCAGCCGTGTGCT 3'] or [5' 
CTCGAGTGTCTGGGCAACCTGGCGCGGG 3']. These prosegment fragments, 
ending ataa 169, 188, and 196 (10), were cloned in the PCR 2.1 TA cloning vector for 
sequencing. Then they were transferred into the BamHI / Xhol sites of the bacteria! 
expression vector pET 24b (Novagen). These recombinants were transformed into the 

10 E. Coli strain BL21. Protein expression was induced with 1mM isopropyl p-D- 
thiogalactoside and the cultures were grown for 3h at 37°C. The cell pellets were 
sonicated on ice in a binding buffer containing 6M guantdine-HCI (Novagen) until a 
clear solution was obtained. The clarified and filtered solution was then applied to a 
nickel affinity column (Novagen) and eluted with 500 mM imidazole. The eluates were 

15 dialyzed overnight at 4°C against 50 mM sodium acetate (pH 7). The protein 
precipitate was solubilized with glacial acetic acid, filtered through a 0.45 urn disk and 
further purified on a 5 urn C4 column (0.94 x 25 cm; Chromatographic Sciences 
Company Inc; CSC) by reverse-phase high performance liquid chromatography (RP- 
HPLC). The purity was assessed by Coomassie staining and the identity of the 

20 products verified by mass spectrometry on a Matrix Assisted Laser Desorption Time 
of Flight (MALDl-TOF) Voyageur DE-Pro instrument (PE PerSeptive Biosystems). The 
amounts of prosegments were determined by quantitative amino acid analysis (13). 

Expression and Purification of Recombinant BTMD-SKI-1 - Following infection 
of BSC40 cells (75 x 1 0 6 cells) with 2 pfu/cell of recombinant W:BTMD-SKI-1 , the cells 

25 were washed and incubated at 37 °C for 18h in a serum-free minimal essential 
medium (MEM; Life Technologies). Media (45 ml) were then dialyzed, concentrated 
20-fold to 2.2 ml on Centriprep-30's (Amicon) and stored at -20 °C in 40 % glycerol. 
For purification 2 , the concentrated media were applied to a NF + affinity resin (Novagen) 
or a Co 2+ affinity resin (Clontech Laboratories) as described by the manufacturer. After 

30 two washes with 5 mM imidazole, the protein was eluted with 200 mM imidazole and 
tested for enzymatic activity and immunoreactivity by Western blot (see below). 

Western Blot Analyses - Aliquots of partially purified BTMD-SKI-1 were 
separated by 8 or 12 % SDS-PAGE followed by electro-transfer of the proteins onto 
polyvinylidene fluoride (PVDF) membranes (Schleicher and Schuell). These 
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membranes were probed with an antiserum directed against either SKI-1 [aa 217-233 
(Ab:N) or aa 634-651 (Ab:S)] or pro-SKI-1 [(aa 18-188 (Ab:P)]. Protein bands were 
visualized by enhanced chemiluminescence (ECL) (Boehringer Mannheim). 

Purification, N-terminal Sequencing and Mass Spectrometric Analysis of the 
5 Secreted Recombinant Prosegment(s) ofhSKI-1 - Concentrated media obtained from 
either W:BTMD-SKI-1 infected BSC40 cells or from a stable transfectant of full-length 
hSKI-1 in HK293 cells (10) were loaded onto an RP-HPLC 5 urn C4 column (0.94 x 25 
cm) (Vydac). Proteins were eluted at 2 ml/min using a 1 %/min linear gradient (15-70 
%) of 0.1 % aqueous trifluoroacetic acid (TFA)/CH 3 CN with monitoring at 210 nm. The 

1 0 products were analyzed by Western blotting, after which the immunoreactive fractions 
were further purified on a CSC 5 urn C4 column (0.2 x 25 cm). Mass values were 
obtained by MALDl-TOF spectrometry using the 1 matrix 3,5 dimethoxy-4- 
hydroxycinnamic acid (Aldrich Chemical Co). For N-terminal sequencing, fraction IV 
proteins (Fig. 21 A) were separated by SDS-PAGE, transferred to Immobilon-P 

15 membranes, and stained with Ponceau Red. The 14 and 5 kDa bands were excised 
and sequenced using an Applied Biosystems Model 477 sequenator operating in the 
gas-phase mode (15). 

Synthesis of Peptide Substrates - All Fmoc amino acid derivatives (L-form), the 
coupling reagents, and the solvents for peptide synthesis were purchased from PE 

20 Biosystems Inc. (Framingham, Mass, USA), Calbiochem (San Diego, Ca, USA), or 
Richelieu Biotechnologies (Montreal, QC, Canada). The various linear synthetic 
peptides and internally quenched fluorogenic (Q-) substrates reported in this article 
are: (I) hproBDNF(50-63): KAGSRGLTSLADTF, (II) hSREBP-2(504-530): 
GGAHDSDQHPHSGSGRSVLSFESGSGG, (III) hSKI-1(174-191). 

25 WHATGRHSSRRLLRAIPR, (IV) hSKl-1 (174-188+LE): WHATGRHSSRRLLRALE, (V) 
hSKI-1(182-188+LE): SRRLLRALE, (VI) hSKI-1 (156-1 72) :WQSSRPLRRASLSLGSG, 
(VII) hSKI-1 (187-201): RAI P RQVAQTLQ ADV; (VIII) hSKI-1 (128-1 36): PQRKVFRSL; 
(IX) hSKI-1 (128-1 42): PQRKVFRSLKYAESD; (X) Q-hSKI-1(132-142): Abz- 



1 Although we managed to produce limited quantities of partially purified SKI-1 using 
metal chelating resins, there was insufficient enzyme to carry out full kinetic analyses. 
However, since the medium of WT virus-(or control vector)-expressing cells produced no 
significant peptide hydrolysis (with the exception of peptides VIII and IX), we mainly used 
the concentrated media of BSC40 cells infected with W.BTMD-SKI-1. Thus, the metal 
chelation-purified enzyme served mainly to verify that the enzyme from concentrated media 
behaved similarly to this form. We therefore confirmed all of the peptide cleavage sites, the 
SREBP-2 pH optimum, and the Ca 2+ requirement presented below. 
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VFRSLKYAESD-Y(N0 2 )-A; (XI) Q-hSKI-1 (134-142): Abz-RSLKYAESD-Y(NO )-^\. 
Except for the first two peptides, which were purchased from the Sheldon 
Biotechnology Institute (McGill University, QC, Canada), all other peptides were 
synthesized with the carboxy-terminus in the amide form. Peptides lll-XI were prepared 
5 on a solid phase peptide synthesizer (Pioneer model, PE Biosystems) using either 2- 
(1H-benzotriazole-1-yl)-1,1,3,3-tetramethyiuronium hexafluophosphate (HBTU) / N- 
hydroxybenzotriazole (HOBT) or HATU (0-[7-azabenzotriazol-1-yi]-N,N,N\N'- 
tetramethyluronium hexafluorophosphate) / diisopropyl ethyl amine (DIEA)-mediated 
Fmoc chemistry with PAL-PEG unloaded resin and the standard side chain protecting 

10 groups (16). For the incorporation of the two unnatural amino acids [Abz and Y(N0 2 )], 
an extended coupling cycle was used instead of either the standard or fast cycles. 

Purification, Analysis, and Digestion of Peptide Substrates - The crude peptides 
were purified by RP-HPLC using a semi-preparative CSC-Exsil C18 column (2.5 x 25 
cm). Monitoring at 210 nm, the peptides were eluted with a 1 %/min linear gradient (5 

15 % to 60 %) of aqueous 0.1 % TFA/CH 3 CN at 2 ml/min and. The peptide purity and 
concentration were determined by quantitative amino acid analysis (16). The identity 
of each purified peptide was confirmed by MALDI-TOF spectrometry using the matrix 
ct-cyano 4-hydroxycinnamic acid (Aldrich Chemical Co). 

For digestions, each peptide was typically reacted at 37 °C with 10 pi of the 

20 concentrated enzyme preparation in a buffer consisting of 50 mM HEPES (N-2- 
Hydroxyethyl Piperazine-N'-2 EthaneSulfonicacid) (ICN Biomedicals Inc), 50 mM MES 
(2-[N-Morpholino] EthaneSulfonic acid) (Sigma Chem Co.), and 3 mM Ca 2+ -acetate (pH 
6.5). The digestion products were separated by RP-HPLC on a Beckman 5 urn 
Ultrasphere C18 column (0.2 x 25 cm) and eluted with a 1 %/min linear gradient of 

25 aqueous 0.1 %TFA/CH 3 CN (5-45 %) at a flow rate of 1 mi/min. The collected peptides 
were characterized by mass spectrometry and amino acid composition, which was also 
used to quantitate the amount of various substrates and products. The digestions of 
the quenched fluorogenic peptides were analyzed by RP-HPLC using a dual UV (210 
nm) and fluorescence (excitation and emission wavelengths of 320 and 420 nm, 

30 respectively) detector (Rainin). 

pH Optimum, Calcium-Dependence and Inhibitor Profile - The protocols used 
were essentially the same as reported previously (13). Stocks of the buffer described 
above were adjusted to pH 5.0-8.5 at 0.5 unit increments by addition of either acetic 
acid or sodium hydroxide. In order to investigate the calcium requirement of SKI-1, 
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increasing concentrations of Ca 2+ -acetate were used ranging from 0 to 10 mM. For 
inhibition studies, the enzyme in the reaction buffer was preincubated with the desired 
agents for 30 min prior to addition of peptide II. 

K m(app) , V max(app) and K i(app) determinations - Following digestion reactions with 
increasing substrate concentrations, the products were separated by RP-HPLC. The 
rate of substrate hydrolysis was obtained from the integrated peak areas of the 
chromatograms. K m<app) and V max(app) values were estimated using nonlinear regression 
analysis (Enzfitter software; Elsevier Biosoft, Cambridge, UK) of plots of the hydrolysis 
rate vs the substrate concentration. For apparent inhibitor constant [K I(app) ] 
determinations, variable inhibitor concentrations within the range of 15-70 % inhibition 
were used at three concentrations of peptide IV ranging from 0.6 to 3.5 times the K m(app) 
value. The K 1(app) values were estimated from Dixon plots as described (16). For the two 
quenched peptides, kinetic parameters were determined as described (17). 

RESULTS 

SKI-1 Overexpression, Purification, Biosynthesis, and Prosegment Processing 
We have previously shown that overexpression of full-length SKI-1 (FL-SKI-1) 
in HK293 cells results in shedding of a 98 kDa form (sSKI-1) of this enzyme into the 
medium (10). Based on this finding, we engineered a soluble form of SKI-1 (BTMD- 
SKI-1), ending at residue 997, to which we added a hexa-His sequence at the C- 
terminus (Fig. 19A). In a comparative biosynthetic analysis, shown in Fig. 19B, LoVo 
cells were infected with the SKI-1 virus constructs W:FL-SKI-1 , W:BTMD-SKI-1 , and 
wild type virus (W:WT). After labeling the cells for 3h with [ 35 S]Cys, proteins in the 
media were immunoprecipitated with an antiserum directed against either the 
prosegment of SKI-1 (Ab:P) or an internal SKI-1 sequence (Ab:S). In both cases, a 
protein of -14 kDa co-immunoprecipitated with the 98 kDa sSKI-1 or the 100 kDa 
BTMD-SKI-1 (bSKI-1, Fig. 19B) that was not seen with W:WT infections. Since Ab:P 
was raised against a recombinant SKI-1 prosegment peptide and has been shown 
previously to detect the SKI-1 zymogen (10), we concluded that the -14 kDa peptide 
is most likely derived from the cleaved prosegment (the full-length prosegment is -24 
kDa - see below). The fact that it co-immunoprecipitated with the enzyme under 
denaturing conditions suggests a strong interaction between SKI-1 and this region of 
its prosegment. The actual stoichiometry of enzyme-to-prosegment is not clear from 
this experiment, since it was carried out using two different antisera and denaturing 
conditions. We also observed that some of the 100 kDa BTMD-SKI-1 is cleaved into 
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a 98 kDa form similar to that found with FL-SKI-1 (Fig. 19B). This conversion is 
presumably carried out by endogenous "shedding enzymes" (10,18) that can act on 
both forms of SKI-1, although C-terminal sequencing would be needed to confirm this 
hypothesis. 

Western blot analyses of media now obtained from BSC40 cells infected with 
W:BTMD-SKI-1 also revealed a secreted -100 kDa immunoreactive band (Fig. 19C). 
The same band was detected using either an antiserum against the N-terminal region 
of the SKI-1 catalytic domain (Ab:/V) or one against a more C-terminal region (Ab:S). 
When Ab:P was mixed together with Ab:S and used to probe the metal affinity column- 
purified SKI-1 preparation (indicated by the * in Fig. 19C), we were able to again detect 
the -14 kDa prosegment fragment, further supporting our hypothesis that it forms a 
strong association with the enzyme. (It should be noted that although a mixture of Ab:S 
and Ab:Pwas used in order to detect both proSKI-1 and BTMD-SKI-1 simultaneously, 
when either Ab:N or Ab:S were used alone, only the 100 kDa or 14 kDa species were 
observed, respectively (not shown)). 

in order to evaluate the rate of zymogen processing and the fate of the 
prosegment, LoVo cells overexpressing W:FL-SKi-1 were pulse-labeled with [ 3 H]Leu 
for 15 min and then chased for 2h. Figure 20 shows an SDS-PAGE analysis of the cell 
lysates immunoprecipitated with Ab:P (left panel). At least five immunoreactive 
polypeptides (molecular masses of -26, 24, 14, 10 and 8 kDa) which were not present 
in controls infected with W:WT, were detected. In order to further define in which 
organelle(s) this processing occured, LoVo cells infected with W:FL-SKI-1- were 
pulse-labeled with [ 3 H]Leu for 2h in the presence or absence of BFA (Fig. 20, right 
panel). In both cases, the same five major, intracellular, immunoreactive prosegment 
forms could still be detected. Since the fungal metabolite BFA is known to disassemble 
the Golgi complex and cause the ER to fuse with the cis, medial and trans Golgi (but 
not the trans Golgi network, TGN) (19), this result strongly implies that the initial 
zymogen processing of proSKl-1 occurs early along the secretory pathway. Possible 
locations include the ER or cis Golgi, as was previously reported (10). Moreover, 
further processing of the prosegment into yet smaller fragments also occurs in these 
organelles. 

To further characterize the prosegment of SKI-1 , we took advantage of a stable 
transfectant of FL-SKI-1 in human HK293 cells that we had made previously (10). This 
system has the added advantage that the possibility of W overexpression artifacts 
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influencing the processing of the prosegment is eliminated. Concentrated culture 
medium from these cells (serum-free) was purified via RP-HPLC using first a semi- 
preparative C4 column (not shown) followed by an analytical C4 column (Fig. 21A). 
The eluted fractions were analyzed by Western blot using Ab:P (Fig. 21 B). 
Immunoreactive peptides ranging from -4.5-24 kDa were apparent. N-termina! 
sequencing of the very abundant -14 kDa protein in fraction IV (Fig. 21 C) revealed a 
major sequence starting at Gly 18 of pre-proSKI-1 (10,12). This clearly defines the signal 
peptidase cleavage site as LWLLC 17 ! GKKHLG, which is one aa before that predicted 
by signal peptidase cleavage site algorithms (10,11). The N-terminal sequence of the 
-4.5 kDa polypeptide (Fig. 21 D) revealed that it starts at Pro 143 , indicating a cleavage 
at the sequence KYAESD 142 lPTVPCNETRWSQK. This fragment is most likely the 
product of cleavage between Asp and Pro that may be caused by the acidic conditions 
encountered in either RP-HPLC, Edman sequencing (20), or sample preparation for 
SDS-PAGE analysis (21). An unexpected benefit of this cleavage was our finding that 
phenylthiohydantoin (PTH)-Asn 148 , which occurs in the putative N-glycosylation site 
AsnGluThrw as readily detected in this sequence. Thus, the predicted N-glycosylation 
site Asn 148 within the prosegment of SKI-1 is not employed, at least in this expression 
system. This conclusion was also supported by the prosegment's resistance to endo 
H and endo F digestion (not shown). Of the two eukaryotic subtilases known to contain 
a potential N-glycosylation AsnGIuThr site, i.e. kexin (22) and SKI-1 (10), it appears 
that at least the latter's prosegment is not N- glycosylated. Finally, the separation of 
the above prosegment fragments from mature SKI-1 using RP-HPLC (Fig. 21A,B) and 
non-reducing SDS-PAGE (not shown), suggests that none of the Cys residues in the 
prosegment (10) are linked by disulfide bridges to the rest of the enzyme. 

As a preliminary means of characterizing the SKI-1 prosegment fragments, MALDI- 
TOF analysis (Fig. 21 E) of fraction IV from Fig. 21 B was carried out. Three major 
molecular ions of masses 13,351, 13,518, and 13,685 Da were detected, with an 
expected error of ± 25 Da for this mass range. Combined with the previous N-terminal 
sequencing results of the -14 kDa peptide (Fig. 21C), these mass values indicate that 
this peptide has heterogeneous C-termini that are derived from cleavages near the 
sequence RKVFRSLK 137 , as indicated in Fig. 21 E. In fact this region contains three 
potential SKI-1 cleavage sites (8) with an R or K at the P4 position and either an F, R 
or K at the P1 position. Although the calculated molecular masses of 13,339, 13,496 
and 13,696 for the polypeptides G 17 KK — RKVF 133 , G 17 KK— RKVFR 134 and G 17 KK— 
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RKVFRSL 136 , respectively, match within experimental error (+ 22 Da) the observed 
masses in Fig. 21 E, these assignments should only be taken as a first indication (see 
below). Moreover, the predicted G 17 KK— RKVFRSL 136 fragment does not correspond 
to the expected SK1-1 cleavage motif of a basic residue at the P4 position. Hence, this 
secreted peptide could result either from cleavage at G 17 KK— RKVFRSL 136 or, more 
likely, at G 17 KK— RKVFRSLK 137 Lys 137 foliowed by basic carboxypeptidase cleavage 
of the C-termina! Lys (23). Since we were unable to obtain consistent mass spectra of 
the -4.5 kDa polypeptide that was sequenced in Fig. 21 D, we could not use this 
technique to approximate its C-terminus, which presumably corresponds to the C- 
terminus of the processed SKI-1 pro-segment. We therefore resorted to synthetic 
peptide cleavage as a tool to accurately define potential prosegment cleavage sites. 

Analysis of Synthetic Prosegment-derived Peptide Cleavages -_Based on our 
detection of ~26and 24 kDa SKI-1 prosegment products (Fig. 20), as well as on a 
mutagenesis study of SREBP-2 cleavage sites (8), we synthesized three SKI-1 
prosegment peptides encompassing potential, C-terminal, autocatalytic cleavage sites 
(10,1 1). All contain Arg at P4 and either Leu, Lys, Ala or Phe at P1 (peptides 111, VI and 
VII shown in Table ll-A). Of these peptides containing only native sequences, the only 
one with detectable cleavage by SKI-1 -containing concentrated medium (from either 
W:BTMD-SKI-1 -infected BSC40 cells or SKI-1 transfected HK293 cells) was peptide 
111 (WHATGRHSSRRLL 186 ! RAIPR) (see Table ll-A). No cleavages were observed 
when W:WT-infected or empty vector-transfected media were used (not shown). 
Metal chelation chromatography-purified enzyme further supported that this cleavage 
is effected by SKI-1 (Fig. 22A; peptide III), and the products were positively identified 
via mass spectrometry. 

Similarly, based on the mass spectrometry data in Fig. 21 E, we synthesized two 
peptides (Vlll and IX) encompassing the putative internal processing site(s) of the SKI- 
1 prosegment. Both were cieaved at multiple locations by SKI-1 -containing 
concentrated medium from HK293 transfectants (nor shown). Further analysis revealed 
that one of these cleavages, corresponding to PQRKVF 133 1RSL, was as prevalent in 
empty vector-transfected HK293 medium as in SKI-1 -transfected medium (see. Table 
lll-A, peptide Vlll). In contrast, the PQRKVFRSLK 137 1 YAESD cleavage was only seen 
in SKI-1 -containing medium. This cleavage was also confirmed using metal chelation 
chromatography-purified enzyme (Fig. 22B; peptide IX) and mass spectrometry to 
identify the products. However, also clearly visible are the PQ RKVF 1 33 1 RS LKY AE S D 
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cleavage products. We acknowledge that there could be residual contaminating 
proteases in our purified SKI-1 preparations (minor bands were visible on colloidal 
gold-stained membranes of SKI-1 preparations). Thus, while we are confident that SKI- 
1 cleaves its prosegment at the C-terminal WHATGRHSSRRLL 186 1 RAIPR site and at 
5 the internal PQRKVFRSLK 137 1 YAESD site, our data do not allow us to rule out SKI-1 - 
mediated cleavage at the PQRKVF 133 1RSLKYAESD site. 

Comparing the simple cleavage rates of the SKI-1 prosegment internal and C- 
terminal sites, we observed that the former was vastly superior to the latter (not 
shown). We also noticed that the peptides best processed by SKI-1 contain an acidic 

10 residue at the P3' or P4' substrate site, whereas those that did not appeared to be 
cleaved poorly or not at all (Table lll-A). Moreover, we had previously established that 
SKI-1 does not cleave the fluorogenic peptides RGLT-MCA, RGLTT-MCA and RSVL- 
MCA (10), which lack P' residues. Based on these observations, we asked if replacing 
the He and Pro residues at P3' and P4' of the C-terminal prosegment processing site 

15 would significantly improve the SKI-1 -mediated cleavage of peptide 111. Thus, we 
synthesized two mutants of this peptide (peptides IV and V, the latter truncated by 8 
aa at the N-terminus) in which the lie and Pro residues at P3' and P4' were replaced 
by Leu and Glu, respectively. As shown in Table ll-B, this change significantly 
improved the processing of these peptides, such that we were able to determine 

20 V max(app) / K m(app) values. The approximately two-fold difference in these values for 
peptides IV and V further suggests that determinants N-terminal to the P4 position may 
also play a role in substrate specificity. The SKI-1 specificity of these peptide 
cleavages was also verified using metal chelation chromatography-purified enzyme 
(when W.WT-infected or empty vector-transfected media were used, no peptide 

25 processing was observed). 

In Vitro Kinetic Properties of SKI-1: Comparative Analysis of Synthetic Peptide 
Cleavages -_ln a previous report (10), sSKI-1 was shown, to cleave the 32 kDa 
proBDNF into a 28 kDa form at the RGLTJSL sequence in vitro with a pH optimum 
close to neutrality. Similar to PCs (1-3), we suggested that SKI-1 might be a Ca 2+ - 

30 dependent enzyme since the calcium ionophore A231 87 inhibited the ex vivo cleavage 
of proBDNF (10). in order obtain kinetic analyses of defined SKI-1 substrates, we 
examined a 14 aa peptide spanning the hproBDNF processing site (10), 
K 50 AGSRGLT I SLADTF 63 (peptide I) and a 27 aa hSREBP-2-related peptide (8), 
G 504 GAHDSDQHPHSGSGRSVLiSFESGSGG 530 (peptide II). Concentrated SKI-1- 
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containing medium (from either W:BTMD-SKI-1 -infected BSC40 cells or SKI-1 
transfected HK293 cells) was reacted with these peptides at pH 6.5, followed by 
MALDI-TOF mass spectrometry analysis of the RP-HPLC-purified products. The 
expected cleavages were confirmed and did not occur using WTVempty vector-derived 
5 media (Fig. 23). Again, the metal chelation chromatography-purified enzyme generated 
the same products as the concentrated media (not shown). We then demonstrated that 
the optima! pH and calcium concentrations for efficient cleavage of the hSREBP-2 
peptide (II) are pH 6.5 and 2 mM Ca 2+ , respectively (Fig. 24). Interestingly, the pH 
optimum observed with the the proBDNF peptide (I) is sharper than that obtained with 

10 peptide II. In the former case, the enzyme still retains about 30% of its activity at pH 
5.0 and 55 % of its activity at pH 8.5 (Fig. 24A). Similar results for the pH optimum of 
peptide II cleavage were obtained with metal chelation-purified BTMD-SK1-1 (not 
shown). In contrast, however, the pH optimum of peptide IX with the purified enzyme 
was 8.0, with no activity detectable below pH 5.5. 

15 A summary of the kinetic analyses of the synthetic proBDNF (peptide I) and SREBP-2 
(peptide II) cleavages by SKI-1 is shown in Table ll-B. Both peptides are cleaved at 
comparable kinetic efficiencies with V max(app) / K m(app) values of 0.002 and 0.004 h" 1 , 
respectively, in comparison, the V max(app) / K m(app) value estimated with peptide IV is 5- 
10-fold higher than those obtained with peptides I and II (Table Il-B). The N-terminal 

20 truncation of peptide IV from 17 to 9 aa (peptide V, Table ll-A) caused a 4-fold 
reduction in catalytic efficiency (Table Il-B). 

Table III shows the inhibitor profile of SKI-1 , in which it is clear that this enzyme 
is quite sensitive to metal chelators such as EDTA and to the calcium chelator EGTA. 
In addition, the transition metals Cu 2+ and Zn 2+ , but not Ni 2+ or Co 2 *, inhibit the enzyme 

25 at mM concentrations. As reported using the 32 kDa proBDNF (10), assays with the 
synthetic SREBP-2 peptide demonstrated that the metal chelator o-phenanthroline 
becomes inhibitory at concentrations above 1 mM. The other non-chelator inhibitors 
tested had minimal or no effects on SKI-1 activity. 
In order to develop a convenient in vitro assay for SKI-1 , we designed a number of 

30 internally quenched fluorogenic substrates and tested their cleavage efficacy by SKI-1 . 
The two best peptides encompassed the processing site RSLKi within the hSKI-1 
prosegment (peptides X and XI, Table ll-A). Mass spectrometric analysis confirmed 
that both peptides were cleaved at the RSLKJ site by shed SKI-1 derived from HK293 
cell transfects, but not by medium obtained from HK293 empty vector transfectants. 
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This processing generated the fluorescent N-terminal peptides Abz-VFRSLK or Abz- 
RSLK, and a non-fluorescent C-terminal peptide YAESDY(N0 2 )-A (not shown). 
Measurements of kinetic parameters demonstrated that peptides X and XI are about 
3- and 16-fold better substrates than the C-terminal prosegment peptide IV (Tables l-B 

5 and 111), suggesting that the shorter peptide XI may be the best SKI-1 substrate tested 
to date. This cleavage was completely abolished in the presence of 10 mM EDTA, in 
agreement with the Ca 2+ -dependence of SKI-1 activity (Fig. 24B). 

SKI-1 Inhibition by its Prosegment -_One important question remaining is 
whether the SKI-1 prosegment functions as an inhibitor of its enzymatic activity, 

10 analogous to the prosegments of other subtilases (3). We thus prepared prosegment 
constructs, designated ending near the proposed C-terminal processing site RRLL 186 
(Fig. 22A): PS1 ; extending to Leu 169 ; PS2, extending to Ala 188 ; and PS3, extending to 
Leu 197 To each C-terminus we coupled a hexa-His tag. These prosegment constructs 
were expressed in bacteria and purified by Ni 2+ -chelation chromatography followed by 

15 RP-HPLC (see Experimental Procedures). The purity of these prosegments was 
confirmed by SDS-PAGE/Coomassie staining and aa analysis (not shown). A summary 
of the inhibitory potency of each prosegment using peptide IV as a substrate is shown 
in Table V. Kinetic analysis using Dixon plots (15) indicated a competitive inhibition 
mechanism (nor shown). Although PS2 exhibits the best apparent inhibitory constant 

20 (K^ = 97 nM), PS3 (K i(app) = 127 nM) and PS1(K i(app) = 182 nM) are similarly potent 
SKI-1 inhibitors. When PS2 was digested with carboxypeptidase B to eliminate the His- 
tag, its inhibitory potency was not affected (not shown), confirming that this tag is not 
responsible for the observed inhibition. We also tested the inhibitory activity of the RP- 
HPLC-fractionated native prosegment (see Fig. 21). Only, the material from fraction 

25 IV, which included the full-length ~24 kDa prosegment, was inhibitory, whereas that of 
the others , including the ~14 kDa peptide alone or in combination with smaller 
fragments, were not inhibitory (nor shown). 

DISCUSSION 

Limited proteolysis of inactive precursor proteins at sites marked by paired or 
30 multiple basic residues is a widespread process (1 ,2). Less common is the recent 
finding that bioactive peptides or proteins can also be generated by limited proteolysis 
after either hydrophobic or small residues (3). SKI-1 represents the first mammalian 
member of subtilisin-like processing enzymes with such substrate specificity (10,1 1). 
It is a widely expressed enzyme (10) that may play a crucial role in cholesterol and 
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fatty acid metabolism (11). Due to its very recent discovery, information regarding its 
enzymatic properties, substrate specificity, and the function of its proregion have only 
begun to be addressed. 

Many peptidyl hydrolases, including subtilases, possess a prodomain which 

5 acts both as an intramolecular chaperone and a highly potent inhibitor of its associated 
protease (24,25). Activation of the enzyme typically requires release of the prosegment 
in an organelle-specific manner. For furin (26) the release occurs in the TGN, whereas 
for PC1 and PC2 (27) it occurs in immature secretory granules. The data presented 
in this report demonstrate that SKI-1 is unique among the mammalian subtilases, since 
/ ' 10 both the C-terminal and internal cleavages of its prosegment occur in the ER. Hence, 
this enzyme does not appear to require an acidic environment for activation, assuming, 
by analogy with other subtilases (3), that prosegment release is the crucial step leading 
to zymogen activation. We propose the following sequence of events presumably 
leading to SKI-1 activation: 1) The signal peptide is removed in the ER by a signal 

15 peptidase cleavage at LWLLC 17 i GKKHLG (Fig. 21C). 2) The prosegment is 
processed into a non-N-glycosylated polypeptide with an apparent molecular mass of 
-24-26 kDa (Fig. 20). 3) This prosegment is further processed into 14, 10 and 8 kDa 
intermediates (Fig. 20). While these multiple cleavages may be catalyzed by SKI-1 
itself, the participation of other proteases cannot be excluded. The major cleavages 

20 leading to the formation of the -24 and -14 kDa products occur within 10 min, and the 
other secondary ones within 30 min (not shown). Since treatment of cells with BFA did 
not significantly alter these processing events, they most likely occur in the ER (Fig. 
20). It is possible that the generation of prosegment fragments from the -24-26 kDa 
pro-form leads to a loss of inhibition in a fashion similar to that of subtilisin E (24,25). 

25 Indeed, our results demonstrate that while the full-length prosegment is inhibitory, its 
-14 kDa product is not. Surprisingly, some pro-region-derived polypeptides are found 
associated with SKI-1 in cell culture media. Thus, in contrast to furin (26), the low pH 
and high Ca 2+ concentrations prevailing in the TGN do not lead to propeptide 
dissociation. High ionic concentrations (up to 1M NaCI) such as those used in 

30 immunoprecipitation (Fig. 19B) and metal chelation protein purification (Fig. 19C) also 
do not disrupt the complex. It is only during RP-HPLC purification (Fig. 21A), in the 
presence of strong acids and organic solvents, that the prosegment peptides 
dissociate from SKI-1. These data suggest that hydrophobic interactions may be 
critical, as is the case for subtilisin (24,25). 
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To distinguish the SKI-1 prosegment autoprocessing sites (C-terminal and internal) 
from several closely situated candidate sites, we employed a combination of mass 
spectrometry and synthetic peptide digestion. In the case of the C-termina! site, only 
one of three candidate peptides (III) was processed by SKI-1 (Table ll-A), indicating 
5 that RRLL 18S iRAIP is the most likely autoprocessing site. For the internal site, 
preliminary mass spectrometric data suggested three distinct cleavages occuring 
within the sequence PQRKVFRSLKYAESD 142 (Fig. 21 E). Two of the three possible 
sites (PQRKVF 133 ! RSLKYAESD and PQRKVFR 134 1 SLKYAESD) appeared to satisfy 
the proposed SKI-1 recognition motif requiring a P4 basic residue (8). The third 

1 0 possibility (PQRKVFRSL 136 ! KYAESD) could be considered by assuming the cleavage 
actually occurred at PQRKVFRSLK 137 i YAESD, followed by endogenous, basic 
carboxypeptidase removal of the C-terminal Lys residue (23). Assays carried out in 
vitro with synthetic peptides corresponding to this region of the prosegment (peptides 
VIII and IX) produced the same cleavage products (not shown), but only the 

1 5 PQRKVFRSLK 137 1 YAESD cleavage was unique to SKI-1 . Thus, we propose that the 
aforementioned site is the most likely internal autoprocessing site, with the qualification 
that PQRKVF 133 ! RSLKYAESD may occur to a lesser extent (see Results and Fig. 22). 

Other information regarding the substrate preferences of SKI-1 was obtained by 
replacing the P3' and P4' He and Pro residues of the C-terminal cleavage site peptide 

20 (III) by Leu and Glu (peptides IV and V) to create a very well processed SKI-1 
substrate. While it would appear that the presence of an acidic residue at P4' 
significantly enhances the rate of substrate hydrolysis, it is also possible that the 
presence of Pro at P4' hinders efficient substrate processing. The presence of similar 
acidic residues at the P3' or P4' position of the two confirmed substrates of SKI-1 

25 (peptides I and II) as well as in the prosegment internal cleavage site RSLK 137 ! YAES 
(Table ll-A) lends support to the first argument. In addition to these residues, others 
also appear to play a role in SKI-1 substrate cleavage catalysis. The peptide pairs IVA/ 
and X/XI both point to influences of positions N -terminal to the P4 residue. 
Interestingly, the efficiency of the truncated C-terminal peptide V is lower than that of 

30 peptide IV, whereas that of the truncated internal (quenched) peptide XI is higher. 
Taken together, these data indicate the importance of aa at both the P and P' positions 
in SKI-1 -mediated substrate hydrolysis. 

The data presented in Fig. 24 indicate that SKI-1 functions most efficiently near neutral 
pH and at 2-3 mM Ca 2 *. This is in general agreement with the conditions that 
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reportedly prevail in the ER (28,29). However, closer examination of the data reveal 
that the pH optimum of SREBP-2 cleavage (peptide II, Fig. 24A) is actually 6.5, an 
observation that we confirmed using our purified SKI-1 preparation (not shown). This 
suggests that the processing of SREBP might occur outside of the ER, perhaps in the 
Golgi where pH values of -6.5 have recently been reported (30,31). Indeed, there is 
now cellular evidence suggesting that SREBP cleavage may occur in the Golgi rather 
than in the ER (32,33). The pH optimum of SKI-1 appears to be dependent on the 
substrate employed; proBDNF (10) and its related peptide (I), appear to be well 
cleaved even at pH 5.5, suggesting that it could cleave this (and possibly other 
substrates) in acidic endosome-like compartments where it was previously localized 
(10). On the other hand, cleavage of the internal, autocata lytic, prosegment processing 
site PQRKVFRSLK 137 1 YAESD (Fig. 22B) is optimal at pH 8 (not shown), implying that 
this event, as we concluded from our biosynthesis assays, takes place most effectively 
in the ER. Overall, the pH and Ca 2+ profiles of SKI-1 resemble those of the 
constitutively secreted PCs (1 ,13). The inhibitor profile of SKI-1 (10, Table III), showing 
that enzymatic activity is significantly inhibited by EDTA, EGTA and only high 
concentrations of o-phenanthroline, tend to discount the likelihood that SKi-1 is a 
transition metal-dependent proteinase. In fact, SKI-1 activity is inhibited by low 
concentrations of certain transition metals, such as Cu 2+ and Zn 2+ . 

Directed by the observation that peptides containing the primary processing site 
of the prosegment of PC1 are potent inhibitors of its activity, and that the C-terminal 
basic residues of furin and PC7 are essential for enzyme inhibition (34,35), we 
assessed the inhibitory potency of three SKI-1 recombinant propeptides. All of these 
end at sequences near the RRLL 186 RA cleavage site. Interestingly, the three 
prosegments displayed comparable inhibitory potencies (Table V). Compared to 
proPCI (34), pro-furin and proPC7 (35), the K )(app) values (Table V) are up to 250 fold 
higher. This suggests that the prosegment of SKI-1 , although potentially inhibitory in 
vivo, may function more as a chaperone, catalyzing the productive folding of SKI-1 . 
Indeed, since SKI-1 may be active in the ER (10,11), whereas the PCs are not (13,26), 
the lower inhibitory potency of the prosegment of SKI-1 may be adapted to the 
conditions prevailing in this cellular compartment. In the case of PCs, highly effective 
inhibition by the prosegment may be needed in order to ensure that these enzymes are 
activated only when they reach the TGN or secretory granules (1-3). The 14 kDa 
fragment, which represents the major secreted form of the prosegment, is tightly 
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associated with SKI-1 (Fig. 19C) yet it is not inhibitory (not shown). Accordingly, this 
segment may serve a chaperonin-like function similar to that reported for the N- 
termina! 150 aa of 7B2 towards proPC2 (36,37). 

Two articles describing the processing, purification and in vitro activity of 
hamster SKI-1/S1P were published (38,39). On most points, our results are in close 
agreement with those recently published. Thus, these authors characterized the 
processing of the SKI-1 /S1P prosegment, proposing that the ER is the major site of 
autocatalytic activation of SKI-1 at the same cleavage sites as we present here. They 
also went on to purify a soluble form of the enzyme, showing that it correctly processes 
SREBP-2 derived peptides as well as a 16 residue peptide spanning the internal 
prosegment cleavage site. In addition, they find that cleavage of fluorogenic RSLK- 
MCA peptide derived from the same sequence is optimal at ~3 mM Ca 2+ at slightly 
alkaline pH. Discrepancies such as the lack of detectable shed SKI-1/S1P, multiple 
secreted prosegment forms, and a different signal peptidase site can most likely be 
attributed to the different cell types and species employed in the two studies. 

In conclusion, the present work firmly establishes that SKI-1 is a Ca 2+ - 
dependent subtilase with a reasonably neutral pH optimum, depending on the 
substrate employed.We also demonstrate that SKI-1 can cleave substrates C-terminal 
to Thr, Leu and Lys residues, thus providing direct, in vitro evidence that it is a 
candidate converting enzyme responsible for the generation of 28 kDa proBDNF (10) 
and SREBP-2 processing at site 1 (11). For efficient cleavage, it appears that 
substrates should contain a basic residue at P4 and an aliphatic one at P2 (Table ll-A). 
Furthermore, aa at the P3' and P4' positions seem to exert an important discriminatory 
effect. The best substrate tested so far is the quenched flurorogenic substrate Abz- 
RSLK _ YAESDY(N0 2 ), thereby providing a convenient and sensitive assay for SKI-1 
activity. The present data demonstrate that only the full length SKI-1 prosegment is 
inhibitory. Thus, overexpression of this prosegment in cell lines may provide a novel 
method for inhibiting the cellular activity of this enzyme in a fashion similar to the that 
of over-expressed profurin and proPC7 (35). Finally, it is anticipated that precursor 
substrates other than the sterol regulating SREBPs (8) and the neurotrophin proBDNF 
(10) will be identified, thereby extending the spectrum of activity of this unique and 
versatile enzyme. 
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Table ll-A 

Synthetic peptide substrates 

Peptides were first reacted with approximately equal quantities of BTMD-SKI-1 
medium for 2-1 8 h as described in "Experimental Procedures". When cleavage was not 
detected, a 10-fold concentrated enzyme preparation was tested. Arrow thickness is 
a qualitative estimate of the cleavage efficacy. 



Peptide P16 P12 P8 P4 PI P4 ' P8 ' 

KAGSRGLT_SLADTF 
GGAHDSDQHPHSGSGRSVL_SFESGSGG 
WHATGRHSSRRLLlRAIPR 
WHATGRHSSRRLL_RALE 
SRRLL_RALE 
WQSSRPL.RRASL- SLGSG 

R A I PRQVA-QTLQADV 
PQRKVF-RSL 
PQRKVFRSLK_YAESD 

Abz-V FRSLiK_YAESD Y(N0 2 ) -A 
Abz-R SLK_YAESD Y(N0 2 ) -A 



1 No cleavage detected even with a 10-fold excess of enzyme. 

2 Cleavage detected but not attributable to SKI-1 . 

3 Kinetic determinations of this peptide were not attempted due to the presence of 
multiple cleavages. 



ii 
in 

IV 
V 

VI 1 

VII 1 

VIII 2 

IX 2 ' 3 
X 

XI 



Kinetic constants for the hydrolysis of peptide substrates by BTMD-hSKI-1 
Increasing concentrations of peptides were reacted with identical quantities of 
BTMD-SKI-1 medium for times chosen to produce 5-30 % substrate hydrolysis. Data 
analysis was carried out as described in "Experimental Procedures". The values are 
averages of duplicate assays. 

Peptide K m(app) V max(app) V m axfrpp) 1 ^m(app) 

( nM*1000) (nmol/h) (h' 1 L" 1 ) 

I 169 0.4 0.002 

II 124 0.5 0.004 

IV 17 0.4 0.023 

V 109 1.1 0.010 



WO 00/26348 



-48- 



PCTyCA99/01058 



Table III 

Effect of selected protease inhibitors on BTMD-hSKM activity 
Digestion reactions using BTMD-SKI-1 medium plus peptide II were carried out 
as described in "Experimental Procedures". The agents were preincubated with the 
5 enzyme for 30 min. 

Inhibitor Concentration Hydrolysis of 

SREBP-2 peptide 



I 




(mM) 


(% of control) 


I 


Control 




100 


I 


APMSF 


1.0 


95 


10 | 


PMSF 


1.0 


85 




TPCK 


1.0 


71 


I 


TLCK 


1.0 


100 


I 


SBTl 


0.5 2 


100 


I 


Cy statin 


0.01 


100 


15 | 


Antipain 


1.0 


100 




Chymostatin 


1.0 


100 




Leupeptin 


1.0 


100 




Pepstatin 


0.1 


97 




E-64 


0.01 


100 


20 | 


O-Phenanthroline 


0.05 


135 






1.0 


90 






5.0 


0 




EDTA 


10.0 


0 




EGTA 


10.0 


15 


25 | 


Dithiothreitol 


10.0 


92 




CuSO, 


1.0 


0 




ZnS0 4 


1.0 


0 




NiS0 4 


1.0 


93 




MgCl 2 


1.0 


100 


30 | 


CoCI 2 


1.0 


100 



1 Values represent averages of duplicate assays (variation is + 5 %). 

2 Concentration in mg/ml. 



35 Table IV 

Kinetic constants for the hydrolysis of quenched fluorogenic substrates by shed-hSKI-1 
Assays and data analysis were carried out as described in Table ll-A. The 
values are averages of duplicate assays. 



WO 00/26348 



-49- 



PCT/CA99/01058 



Peptide K m(app) V max(app) V max(app) / K m(app) 

(uM) (umoles/h) (hr 1 L" 1 ) 

X 31.3 34.0 1.1 

XI 8.7 56.9 6.5 



Table V 

Effect of pro-segment peptide constructs on BTMD-hSKI-1 activity 

Digestion reactions using BTMD-SKI-1 medium plus peptide IV were carried out 

as described in "Experimental Procedures". The prosegment peptides were 

preincubated with the enzyme for 30 min. Values were deduced from the Dixon plots 

obtained from three separate experiments. 

Pro-segment construct K, (app) 

(nM) 

PS1 182.0 ±0.5 

PS2 97.5 ±4.5 

PS3 127.3 ±6.2 



EXAMPLE 4 

SIMILARITY OF ANATOMICAL DISTRIBUTION OF SKI-1 mRNA TO THAT OF APP 

B-amyloid precursor protein ( B-APP ) is a member of a highly conserved gene 
family, which includes amyloid precursor-like protein-1 and amyloid precursor-like 
protein-2 { McNamara, MJ. etal. (1998) Brain Research 804, 45-51 ; Rassoulzadegan, 
M. et al. (1998) The EMBO Journal 17, 4647-4656 }. Mammalian subtilases, 
exemplified by SKi-1 , may be responsible for limited cleavage at hydrophobic residues 
present in biologically important precursor proteins such as S-amyloid precursor 
protein ( fc-APP ) ( TableVl). SKI-1 has recently been identified as the enzyme which 
cleaves sterol-regulatory element-binding protein (SREBP) in a fashion analogous to 
the li-secretase cleavage of APP { Sakai, J. et al. (1998) Molecular Cell 2, 505-514 }. 
The cleavage of SREBP by SKI-1 ( Site 1 protease ) at a position 20 residues to the 
lumenal side of the first membrane-spanning segment is analogous to the B-secretase 
cleavage of B-APP at a position 28 amino acids from the membrane { Brown, M.S. and 
Goldstein, J.L. (1997) Cell 89, 331-340 }. 
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Similarity of anatomical distribution of SKI-1 mRNA to that of APP suggests a 
functional link between both proteins. 

In situ hybridization performed in 4-day-mouse provides evidence of a similar 
distribution of mRNA coding for the membrane proteins SKI-1 and APP ( Fig. 25 ). 
5 Their spatial distribution was observed to be significantly overlapping within different 
tissues such as brain and spinal cord, cranial and spinal ganglia, submaxillary gland, 
thymus, kidney, bones, skin and many other. Their mRNA distribution was partially 
similar to that of two other proteases, namely the convertase furin and the peptidase 
nepriiysin. A much different distribution was observed with convertases PC1 , PC2 and 

10 PC5. It is clearly established that an increase in cellular cholesterol levels results in 
the inhibition of activity of SKI-1 / S1P { reviewed in Edwards, P.A., and Ericsson, J. 
(1999) Annu. Rev. Biochem. 68, 157-185}. In a similiar fashion, an increase in dietary 
cholesterol leds to significant reductions in brain levels of secreted APP derivatives, 
including sAPPa, sAPPB, AB1-40 and AB1-42 { Howiand, D.S. et al. (1998) J. Biol. 

1 5 Chem. 273, 1 6576-1 6582 }. The nature of the relationships between cholesterol, SKI-1 
and APP metabolism are complex. 

Cellular association between SKI-1 and APP in lacrimal gland. Potential use of 
20 shed SKI-1 in tears as diagnostic tool. 

Results of immunocytochemistry performed in mouse lacrimal glands provides 
evidence for the presence of SKI-1 and APP in the same cells types, including 
intralobular duct epithelial cells and some acinar cells ( Fig. 26 ). The finding of SKI-1 
in the lacrinal gland suggests the possibility of developing a diagnostic assay 
25 analyzing tears; perhaps based on two -dimensional polyacrylamide gel 
electrophoresis for disease diagnosis { Molley, M.P. et al. (1997) Electrophoresis 18, 
2811-2815; Glasson, M.J. etal. (1998) Electrophoresis 19, 852-855; Grus, F.H., and 
Augustin, A.J. (1999) Electrophoresis 20, 875-880; Iskeleli, G. et al. (1999) 
Electrophoresis 20, 875-880 }. 
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EXAMPLE 5 

Prodomains in general ( for example furin and PC7 prodomains ) function in 
trans when expressed in mammalian cells to inhibit their cognate subtilisn-like 
convertase 

5 We have recently shown that the prosegment of furin expressed as an 

independent domain ( preprofurin, ppfurin ) can specifically inhibit neurotrophin 
processing. In these assays, successful inhibition requires not only that the prodomain 
enter the secretory pathway, but that it remain there long enough to interact with the 
target PC (most likely furin within the TGN ). Figures 27 & 28 depict vaccinia virus 

1 0 constructs or transient transfections of prosegments preventing the maturation of the 
neurotrophins NGF and BDNF in Schwann or COS-1 cells, respectively. The modest 
inhibition with the prodomain of PC7 ( ppPC7) is most likely due to inhibition of furin, 
since PC7 is a poor effector of proNGF and proBDNF maturation in these cells. The 
complementary experiment to demonstrate selectivity by the prosegment of PC7 will 

15 be carried out once we are able to establish unique in vivo PC7 substrates. 

Most proteases from the four major classes ( thiol, aspartic, serine, and metallo) 
are synthesized as inactive precursor molecules with N-terminal extensions 
(prosegments ) that play critical roles in folding, stability and regulation of enzymatic 
activity { Khan, A.R., and James, M.N. (1998) Protein Sci. 7, 815-836 }. The proregions 

20 of the PCs have been shown to function as potent inhibitors of their cognate enzymes 
in vitro. We present data for the first time showing that the expression of a prosegment 
as an independent domain in a cell-based ( ex vivo ) assay functions as a PC inhibitor 
( Figs. 27 and 28 ). In these assays, successful inhibition requires not only that the 
prodomain enter the secretory pathway, but that it remain there long enough to interact 

25 with the target PC ( most likely furin within the TGN ). 

We have shown that expression of full length SKI-1 prosegment ( 22-24 kDa 
with sequence ending at the secondary cleavage sequence RHSSRRLL ) inhibits SKI- 
1 activity in stable HK 293 cell lines (Example 2). However, since the prodomain of 
SKI-1 is processed at an internal primary cleavage site RKVFRSLK to give a 14 kDa 

30 N-terminal fragment ( Fig. 29A&B ) we predict that mutation of this site will generate 
an even more effective SKI-1 inhibitor, in fact, in the case of the mouse PC5 
prodomain we have shown that mutation of the internal prosegment cleavage site does 
in fact generate a inhibitor of integrin ha 4 150 kDa processing to 80kDa and 70kDa 
species ( Fig. 15 ). 
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EXAMPLE 6 

SKI-1 Peptide Substrates for fluorescence resonance energy transfer ( FRET ) 
- Based Proteolysis Assays 

A large number of synthetic peptides based on potential cleavage sites in the 
5 hSKI-1 prodomain, proBDNF and the loop region of SREBP-2 were synthesized. 
These are: 

(i) hSKI-1 (156-172) 
Trp-Gln-Ser-Ser-Arg-Pro-Leu-Arg-Arg-Ala-Ser-LeulSer-Leu-Gly-Ser-Gly 

(ii) hSKI-1 (174-191) 

10 Trp-His-Ala-Thr-Gly-Arg-His-Ser-Ser-Arg-Arg-Leu-LeuiArg-Ala-lle-Pro-Arg 
(Hi) hSKI-1 (174-1 88+Leu+Glu) 
Trp-His-Ala- Thr-Gly-Arg-His-Ser-Ser-Arg-Arg-Leu-Leu i Ara-Ala- Leu-Giu 

(iv) hSKI-1 (181-188+Glu) 
Ser-Ser-Arg-Am-Leu-LeulArg-Ala-lf e-Glu 
15 (v) hSKI-1 (187-201) 

Arg-Aia-lle-Pro-Arg-Gin-Val-Ala i Gln-Thr-Leu-Gln-Ala-Asp-Vai 

(vi) hSKI-1 (128-136) 
Pro-Gln-Arg-Lys-Val-Phe-Arg-Ser-Leu 

(vii) hSKI-1 (128-142) 

20 Pro-Gln-Arg-Lys- Val-Phe-Arg-Ser-Leu-Lys I Tyr-Ala-Glu-Ser-Asp 

(viii) hProBDNF (50-63) 
Lys-Ala-Gly-Ser-Arg-Gly-Leu-ThrlSer-Leu-AJa-Asp-Thr-Phe 

(ix) SREBP-2 27 mer 
Gly-Gly-Ala-His-Asp-Ser-Asp-Gln-His-Pro-His-Ser-Gly-Ser-Gly-Arg-Ser-Vai- 

25 Leu l Ser-Phe-Glu-Ser-Gly-Ser-Gly-Gly 

(x) SREBP-2 10 mer 
Ser-Gly-Ser-Gly-Arg-Ser- Val-Leu I Ser-Phe-GJu-Ser 

These peptides were examined as possible substrates of SKI-1. Our data indicate 
30 that only the peptides (iii), (iv), (vii), (viii) (ix) and (x) are efficiently cleaved by 
the recombinant SKI-1 . 

NOVEL FLUOROGENIC SUBSTRATE BASED ASSAY OF SKI-1 ACTIVITY: 
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Based on the results reported above with various synthetic peptides we designed 
a number of internally quenched fluorogenic substrates of SKI-1. Our main goal 
was to develop a rapid and a sensitive method for the assay of SKI-1 enzymatic 
5 activity. SKI-1 activity was monitored by following the cleavage of suitable peptide 
substrates with HPLC that is often extremely slow and cumbersome. The following 
internally quenched fluorogenic peptides were synthesized and tested as 
substrates for SKI-1: 

10 (a) QSKI (132-142): 

Abz-Val-Phe-Arg-Ser-Leu-Lys I Tyr-Ala-Glu-Ser-Asp -Tyr(NQ 2 )-Ala 

(b) QSKI (134-142): 

Abz-Arg-Ser-Leu-Lys i Tyr-Ala-Glu-Ser-Asp -Tyr(NQ 2 )-Ala 

15 

(c) QSKI (178-188) 

Abz-Arg-His-Ser-Ser-Arg-Arg-Leu-Leu i Arg-Ala-lle-Iyjr(NQ 2 )-Ala 

(d) QSKI (181-1 87+Leu+GIu) 

20 Abz-Ser-Arg-Arg-Leu-Leu i Arg-Ala-Leu-Glu- Tyr(NQ 2 )-Ala 

(e) QBDNF (47-58) 

Abz-Asn-Gly-Pro-Lys-Ala-Gly-Ser-Arg-Gly-Leu-Thr I Ser -Tyr(NO ,)-Ata 

25 The main feature of these peptides is the incorporation of two special amino 

acids namely Abz [Ortho amino benzoic acid also known as anthranalic acid] 
and Tyr(N0 2 ) [3-nitro Tyrosin] at the amino (N-) and carboxy (C-) terminal end 
of the peptide chain respectively. Abz, an electron donor, is a powerful 
fluorescent moiety whereas Tyr(N0 2 ), an electron acceptor, acts as a 

30 fluorescence quench group. All the above peptides exhibit weak fluorescence 
background values (at A ex = 320 nm and A em = 420 nm). It is expected that upon 
cleavage by the proteolytic action of SKI, these peptides will release two peptide 



WO 00/26348 



-55- 



PCT/CA99/01058 



fragments of which the Abz-containing N-terminal part should display a very high 
degree of fluorescence. The net result will be the increase of fluorescence intensity 
that can be measured very accurately with a fluorimeter instrument. This technique 
of measurement of enzymatic activity has been applied to a number of enzymes 
5 { F. Jean, A. Boudreault, A. Basak, N. G. Seidah and C. Lazure.,., J. Biol. Chem., 
1995, 270, 19225-19231} 

RESULTS 

Our data indicates that among the above quenched fluorogenic peptides, 
10 peptide (a) is most effective as a substrate for SKI-1 . In fact the measurement of 
kinetic parameters (V max /Km) indicted that this peptide is 6-fold more efficient that 
the nearest candidate quenched peptide (b) . HPLC analysis using both UV and 
fluorescence detector systems clearly revealed a single site of cleavage in 
peptides (a) and (b) (as indicated above by a vertical arrow I), again reenforcing 
15 the notion that the preferred sequence motif for SKI-1 is characterized by the 
presence of an Arg residue at P4 , an alkyl hydrophobic residue at P2 and possibly 
an aromatic hydrophobic residue at P1'. Therefore, peptide (a) is a highly 
specific fluorogenic substrate for monitoring the activity of SKI-1 

This invention has been described in details hereinabove, and it will be 
20 readily apparent to the skilled artisan that modifications can be made thereto 
without departing form the teachings of the present disclosure. These modifications 
are considered within the scope of the present invention, as defined in the 
appended claims. 
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Human SKI-1 



cagggcacgctgggtcggcggagctgaggctcccagctgtgggcctcgctggcccggtcg 
gtcccgtgcgacccagccgcctcgactccgagggtcgacacccggagcgaccgggccagc 
1 + + + + + + 60 

cccagtctcgcgagagttgggagtaaacagccccgaatggagtgcccaggcgtgttcgcc 
gggtcagagcgctctcaaccctcatttgtcggggcttacctcacgggtccgcacaagcgg 

61 + + + + + + 120 

gcggaggcgccgttatcccgggcccgccggccctgagctcccggcggcgcagattggctc 
cgcctccgcggcaatagggcccgggcggccgggactcgagggccgccgcgtctaaccgag 

121 + + + + + + 180 

acagtggttgattgatcaaccccattggacgttggttctgtggtacaaatggagtacagg 
tgtcaccaactaactagttggggtaacctgcaaccaagacaccatgtttacctcatgt.ee 

181 + + + + + + 240 

actcagtcgtcacggcctgagtgagagaagccttatttccaagatggagaagaagcggag 
tgagtcagcagtgccggactcactctcttcggaataaaggttctacctcttcttcgcctc 

241 + + + + + + 300 

aaagaaatgaaagcctctcttcaggctgaaccacaaaaggccatgggatttaacttttat 
tttctttactttcggagagaagtccgacttggtgttttccggtaccctaaattgaaaata 

301 + + + + + + 3 60 

ttatgttgggcaagactgtaagatggctgatcagtaatgttgcagcttttagctgaaaca 
aatacaacccgttctgacattctaccgactagtcattacaacgtcgaaaatcgactttgt 

361 + + + + + + 420 

aaaattcacttttaatcaagaagaaaaaagtgtgatttgaatatatgcaattttatgatc 
ttttaagtgaaaattagttcttcttttttcacactaaacttatatacgttaaaatactag 

421 + + + + + + 480 

1 MKLVN IWLLLLVVLL 15 

atattcgcttgtgaccatgaagcttgtcaacatctggctgcttctgctcgtggttttgct 
tataagcgaacactggtacttcgaacagttgtagaccgacgaagacgagcaccaaaacga 

481 + + + + + + 540 

16 CGKKHLGDRLEKKSFEKAPC 35 
ctgtgggaagaaacatctgggcgacagactggaaaagaaatcttttgaaaaggccccatg 
gacacccttctttgtagacccgctgtctgaccttttctttagaaaacttttccggggtac 

541 + + + + + + 600 

36 PGCSHLTLKVEFSSTVVEYE 55 
ccctggctgttcccacctgactttgaaggtggaattctcatcaacagttgtggaatatga 
gggaccgacaagggtggactgaaacttccaccttaagagtagttgtcaacaccttatact 

601 + + + + + + 660 

56 YIVAFNGYFTAKARNSFISS 75 
atatattgtggctttcaatggatactttacagccaaagctagaaattcatttatttcaag 
tatataacaccgaaagttacctatgaaatgtcggtttcgatctttaagtaaataaagttc 

661 + + + + + + 720 

76 ALKSSEVDNWRIIPRNNPSS 95 
tgccctgaagagcagtgaagtagacaattggagaattatacctcgaaacaatccatccag 
acgggacttctcgtcacttcatctgttaacctcttaatatggagctttgttaggtaggtc 

721 + + + + + + 780 

96 DYPSDFEVIQIKEKQKAGLL 115 
tgactaccctagtgattttgaggtgattcagataaaagaaaaacagaaagcggggctgct 
actgatgggatcactaaaactccactaagtctattttctttttgtctttcgccccgacga 

781 + + + + + + 840 

116 T LEDHPN IKRVTPQRKVFRS 135 
aacacttgaagatcatccaaacatcaaacgggtcacgccccaacgaaaagtctttcgttc 
ttgtgaacttctagtaggtttgtagtttgcccagtgcggggttgcttttcagaaagcaag 

841 + + + + + + 900 

136 LKYAESDPTVPCNETRWSQK 155 
cctcaagtatgctgaatctgaccccacagtaccctgcaatgaaacccggtggagccagaa 
ggagttcatacgacttagactggggtgtcatgggacgttactttgggccacctcggtctt 

901 + + + + + + 960 

156 WQSSRPLRRASLSLGSGFWH 175 
gtggcaatcatcacgtcccctgcgaagagccagcctctccctgggctctggcttctggca 
caccgttagtagtgcaggggacgcttctcggtcggagagggacccgagaccgaagaccgt 

961 + + + + + + 1020 
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176 ATGRHSSRRLLRAIPRQVAQ 195 
tgctacgggaaggcattcgagcagacggctgctgagagccatcccgcgccaggttgccca 
acgatgcccttccgtaagctcgtctgccgacgactctcggtagggcgcggtccaacgggt 

1021 + + + + + + 1080 

196 TLQADVLWQMGYTGANVRVA 215 
gacactgcaggcagatgtgctctggcagatgggatatacaggtgctaatgtaagagttgc 
ctgtgacgtccgtctacacgagaccgtctaccctatatgtccacgattacattctcaacg 

1081 + + + + + + 1140 

216 VFDTGLSEKHPHFKNVKERT 235 
tgtttttgacactgggctgagcgagaagcatccccacttcaaaaatgtgaaggagagaac 
acaaaaactgtgacccgactcgctcttcgtaggggtgaagtttttacacttcctctcttg 

1141 + + + + + + 1200 

236 NWTNERTLD DGLGHGT FVAG 255 
caactggaccaacgagcgaacgctggacgatgggttgggccatggcacattcgtggcagg 
gttgacctggttgctcgcttgcgacctgctacccaacccggtaccgtgtaagcaccgtcc 

1201 + + + + + + 126O 

256 VIASMRECQGFAPDAELHIF 275 
tgtgatagccagcatgagggagtgccaaggatttgctccagatgcagaacttcacatttt 
acactatcggtcgtactccctcacggttcctaaacgaggtctacgtcttgaagtgtaaaa 

1261 + + + + + + 1320 

276 RVFTNNQVS YTSWFLDAFNY 295 
cagggtctttaccaataatcaggtatcttacacatcttggtttttggacgccttcaacta 
gtcccagaaatggttattagtccatagaatgtgtagaaccaaaaacctgcggaagttgat 

1321 + + + + + + 1380 

296 AILKKIDVLNLSIGG PDFMD 315 
tgccattttaaagaagatcgacgtgttaaacctcagcatcggcggcccggacttcatgga 
acggtaaaatttcttctagctgcacaatttggagtcgtagccgccgggcctgaagtacct 

1381 + + + + + + 1440 

316 HPFVDKVWELTANNVIMVSA 335 
tcatccgtttgttgacaaggtgtgggaattaacagctaacaatgtaatcatggtttctgc 
agtaggcaaacaactgttccacacccttaattgtcgattgttacattagtaccaaagacg 

1441 + + + + + + 1500 

336 IGNDGPLYGTLNNPADQMDV 355 
tattggcaatgacggacctctttatggcactctgaataaccctgctgatcaaatggatgt 
ataaccgttactgcctggagaaataccgtgagacttattgggacgactagtttacctaca 

1501 + + + + + + 1560 

356 IGVGGIDFEDNIARFSSRGM 375 
gattggagtaggcggcattgactttgaagataacatcgcccgcttttcttcaaggggaat 
ctaacctcatccgccgtaactgaaacttctattgtagcgggcgaaaagaagttcccctta 

1561 + + + + + + 162 o 

376 TTWELPGGYGRMKPDIVTYG 395 
gactacctgggagctaccaggaggctacggtcgcatgaaacctgacattgtcacctatgg 
ctgatggaccctcgatggtcctccgatgccagcgtactttggactgtaacagtggatacc 

1621 + + + + + + 1680 

396. AGVRG SGVKGGCRAL SGTSV 415 
tgctggcgtgcggggttctggcgtgaaaggggggtgccgggccctctcagggaccagtgt 
acgaccgcacgccccaagaccgcactttccccccacggcccgggagagtccctggtcaca 

1681 + + + + + + 1740 

416 ASPVVAGAVTLLVSTVQKRE 435 
tgcttctccagtggttgcaggtgctgtcaccttgttagtgagcacagtccagaagcgtga 
acgaagaggtcaccaacgtccacgacagtggaacaatcactcgtgtcaggtcttcgcact 

1741 + + + + + + 1800 

436 LVNPASMKQALIASARRLPG 455 
gctggtgaatcccgccagtatgaagcaggccctgatcgcgtcagcccggaggctccccgg 
cgaccacttagggcggtcatacttcgtccgggactagcgcagtcgggcctccgaggggcc 

1801 + + + + + + i860 

456 VNMFEQGHGKLDLLRAYQIL 475 
ggtcaacatgtttgagcaaggccacggcaagctcgatctgctcagagcctatcagatcct 
ccagttgtacaaactcgttccggtgccgttcgagctagacgagtctcggatagtctagga 
1861 + + + + + + 1920 
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476 N S YKPQASLS PSYIDLTECP 495 
caacagctacaagccacaggcaagtttgagccccagctacatagatctgactgagtgtcc 
gttgtcgatgttcggtgtccgttcaaactcggggtcgatgtatctagactgactcacagg 

1921 + + + + + + 1980 

496 YMWPYCSQPI YYGGMPTVVN 515 
ctacatgtggccctactgctcccagcccatctactatggaggaatgccgacagttgttaa 
gatgtacaccgggatgacgagggtcgggtagatgatacctccttacggctgtcaacaatt 
1981 + + + + + + 204Q 

516 VTILNGMGVTGRIVDKPDWQ 535 
tgtcaccatcctcaacggcatgggagtcacaggaagaattgtagataagcctgactggca 
acagtggtaggagttgccgtaccctcagtgtccttcttaacatctattcggactgaccgt 

2041 + + + + + + 2100 

536 PYLPQNGDNIEVAFSYSSVL 555 
gccctatttgccacagaacggagacaacattgaagttgccttctcctactcctcggtctt 
cgggataaacggtgtcttgcctctgttgtaacttcaacggaagaggatgaggagccagaa 

2101 + + + + + + 2160 

556 WPWSGYLAISISVTKKAASW 575 
atggccttggtcgggctacctggccatctccatttctgtgaccaagaaagcggcttcctg 
taccggaaccagcccgatggaccggtagaggtaaagacactggttctttcgccgaaggac 

2161 + + + __+ + + 2220 

576 EGIAQGHVMITVASPAETES 595 
ggaaggcattgctcagggccatgtcatgatcactgtggcttccccagcagagacagagtc 
ccttccgtaacgagtcccggtacagtactagtgacaccgaaggggtcgtctctgtctcag 

2221 + + + + + + 2280 

596 KNGAEQTSTVKLPIKVKIIP 615 
aaaaaatggtgcagaacagacttcaacagtaaagctccccattaaggtgaagataattcc 
ttttttaccacgtcttgtctgaagttgtcatttcgaggggtaattccacttctattaagg 

2281 + + + + + + 2340 

616 TPPRSKRVLWDQYHNLRYPP 635 
tactcccccgcgaagcaagagagttctctgggatcagtaccacaacctccgctatccacc 
atgagggggcgcttcgttctctcaagagaccctagtcatggtgttggaggcgataggtgg 

2341 + + + + + + 2400 

636 GY FPRDNLRMKNDPLDWNGD 655 
tggctatttccccagggataatttaaggatgaagaatgaccctttagactggaatggtga 
accgataaaggggtccctattaaattcctacttcttactgggaaatctgaccttaccact 
2 4 oi + + + + + + 2460 

656 H I HTNFRDMYQHLRSMGYFV 675 
tcacatccacaccaatttcagggatatgtaccagcatctgagaagcatgggctactttgt 
agtgtaggtgtggttaaagtccctatacatggtcgtagactcttcgtacccgatgaaaca 

24 61 + + + + + + 2520 

676 EVLGAPFTCFDASQYGTLLM 695 
agaggtcctcggggcccccttcacgtgttttgatgccagtcagtatggcactttgctgat 
tctccaggagccccgggggaagtgcacaaaactacggtcagtcataccgtgaaacgacta 

2521 + + + + + + 2580 

696 VDSEEEYFPEEIAKLRRDVD 715 
ggtggacagtgaggaggagtacttccctgaagagatcgccaagctccggagggacgtgga 
ccacctgtcactcctcctcatgaagggacttctctagcggttcgaggcctccctgcacct 

2581 + + + + + + 2640 

716 NGLSLVIFSDWYNTSVMRKV 735 
caacggcctctcgctcgtcatcttcagtgactggtacaacacttctgttatgagaaaagt 
gttgccggagagcgagcagtagaagtcactgaccatgttgtgaagacaatactcttttca 

2641 + + + + + + 2700 

736 KFYDENTRQWWMPDTGGANI 755 
gaagttttatgatgaaaacacaaggcagtggtggatgccggataccggaggagctaacat 
cttcaaaatactacttttgtgttccgtcaccacctacggcctatggcctcctcgattgta 

2701 + + + + + + 2760 

756 PALNELLSVWNMGFSDGLYE 775 
cccagctctgaatgagctgctgtctgtgtggaacatggggttcagcgatggcctgtatga 
gggtcgagacttactcgacgacagacacaccttgtaccccaagtcgctaccggacatact 
2761 + + + + + + 2820 
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776 GEFTLANHDMYYASGCS IAK 795 
aggggagttcaccctggccaaccatgacatgtattatgcgtcagggtgcagcatcgcgaa 
tcccctcaagtgggaccggttggtactgtacataatacgcagtcccacgtcgtagcgctt 

2821 + + + + + + 2880 

796 FPEDGVVITQTFKDQGLEVL 815 
gtttccagaagatggcgtcgtgataacacagactttcaaggaccaaggattggaggtttt 
caaaggtcttctaccgcagcactattgtgtctgaaagttcctggttcctaacctccaaaa 

2881 + + + + + + 2940 

816 KQETAVVENVPILGLYQIPA 835 
aaagcaggaaacagcagttgttgaaaacgtccccattttgggactttatcagattccagc 
tttcgtcctttgtcgtcaacaacttttgcaggggtaaaaccctgaaatagtctaaggtcg 
2 9^1 + + + + + + 3000 

836 EGGGRIVLYGDSNCLDDSHR 855 
tgagggtggaggccggattgtactgtatggggactccaattgcttggatgacagtcaccg 
actcccacctccggcctaacatgacatacccctgaggttaacgaacctactgtcagtggc 

3001 + + + + + + 3060 

856 QKDC FWLLDALLQYTSYGVT 875 
acagaaggactgcttttggcttctggatgccctcctccagtacacatcgtatggggtgac 
tgtcttcctgacgaaaaccgaagacctacgggaggaggtcatgtgtagcataccccactg 

3061 + + + + + + 3120 

876 PPSLSHSGNRQRPPSGAGSV 895 
accgcctagcctcagtcactctgggaaccgccagcgccctcccagtggagcaggctcagt 
tggcggatcggagtcagtgagacccttggcggtcgcgggagggtcacctcgtccgagtca 

3121 + + + + + + 3180 

896 TPERMEGNHLHRYSKVLEAH 915 
cactccagagaggatggaaggaaaccatcttcatcggtactccaaggttctggaggccca 
gtgaggtctctcctaccttcctttggtagaagtagccatgaggttccaagacctccgggt 
3181 + + + + + + 324Q 

916 LGDPKPRPLPACPRLSWAKP 935 
tttgggagacccaaaacctcggcctctaccagcctgtccacgcttgtcttgggccaagcc 
aaaccctctgggttttggagccggagatggtcggacaggtgcgaacagaacccggttcgg 
324 ! + + + + + + 3300 

936 QPLNETAPSNLWKHQKLLSI 955 
acagcctttaaacgagacggcgcccagtaacctttggaaacatcagaagctactctccat 
tgtcggaaatttgctctgccgcgggtcattggaaacctttgtagtcttcgatgagaggta 
3301 + + + + + + 3360 

956 DLDKVVLPNFRSNRPQVRPL 975 
tgacctggacaaggtggtgttacccaactttcgatcgaatcgccctcaagtgaggccctt 
actggacctgttccaccacaatgggttgaaagctagcttagcgggagttcactccgggaa 

33 61 + + + + + + 3 4 2o 

976 SPGESGAWDIPGGIMPGRYN 995 
gtcccctggagagagcggcgcctgggacattcctggagggatcatgcctggccgctacaa 
caggggacctctctcgccgcggaccctgtaaggacctccctagtacggaccggcgatgtt 

3421 + + + + + + 3480 

996 QEVGQTIPVFAFLGAMVVLA 1015 
ccaggaggtgggccagaccattcctgtctttgccttcctgggagccatggtggtcctggc 
ggtcctccacccggtctggtaaggacagaaacggaaggaccctcggtaccaccaggaccg 

3481 + + + + + + 3540 

1016 FFVVQ INKAKSRPKRRKPRV 1035 
cttctttgtggtacaaatcaacaaggccaagagcaggccgaagcggaggaagcccagggt 
gaagaaacaccatgtttagttgttccggttctcgtccggcttcgcctccttcgggtccca 

3541 + + + + + + 3600 

1036 KRPQLMQQVHPPKTPSV* 1053 
gaagcgcccgcagctcatgcagcaggttcacccgccaaagaccccttcggtgtgaccggc 
cttcgcgggcgtcgagtacgtcgtccaagtgggcggtttctggggaagccacactggccg 

3601 + + + + + + 3660 

agcctggctgaccgtgagggccagagagagccttcacggacggcgctggtgggtgagccg 
tcggaccgactggcactcccggtctctctcggaagtgcctgccgcgaccacccactcggc 

3661 + + + + + + 3720 

agctgtggtggcggctggtttaaaagggatccagtttccagctgcaggtttgttagagtc 
tcgacaccaccgccgaccaaattttccctaggtcaaaggtcgacgtccaaacaatctcag 

3721 + + + + + + 3780 
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tgttctacatgggcctgccctcctgtgatgggcagaggctcctggtacatcgagaagatt 
acaagatgtacccggacgggaggacactacccgtctccgaggaccatgtagctcttctaa 

3781 + + + + + + 3840 

cctgtggatcccgtcaggagggacttagtggctctgccgccagtgagacttcccgccggc 
ggacacctagggcagtcctccctgaatcaccgagacggcggtcactctgaagggcggccg 

3841 + + + + + + 3900 

agctgtgcgcaccaaagactcgggagaactggaaaggctgtctggggtcttctgactgca 
tcgacacgcgtggtttctgagccctcttgacctttccgacagaccccagaagactgacgt 

3901 + + + + + + 396O 

ggggaaggatgtactttccaaacaaatgatacaaccctgaccaagctaaaagacgcttgt 
ccccttcctacatgaaaggtttgtttactatgttgggactggttcgattttctgcgaaca 

3961 + + + + + + 4 020 

taaaggctattttctatatttattgttgggaaaagtcactttaaagacttgtgctatttg 
atttccgataaaagatataaataacaacccttttcagtgaaatttctgaacacgataaac 

4021 + + + + + + 408O 

gaagcaaagctattttttttgtcagtggaatgcagtttttttactattccatcatgagga 
cttcgtttcgataaaaaaaacagtcaccttacgtcaaaaaaatgataaggtagtactcct 

4081 + + + + + + 4140 

acaacatagattccatgatctttttaatgacagtacagactgagatttgaaggaaacatg 
tgttgtatctaaggtactagaaaaattactgtcatgtctgactctaaacttcctttgtac 

4141 + + + + + + 4200 

cacaaatctgtaaaacatagaccttcgctttatttttgtaagtatcacctgccaccatgt 
gtgtttagacattttgtatctggaagcgaaataaaaacattcatagtggacggtggtaca 

4201 + + + + + + 426O 

tttgtaatttgaggtcttgatttcaccattgtcggtgaagaaaattttcaataaatatgt 
aaacattaaactccagaactaaagtggtaacagccacttcttttaaaagttatttataca 

4261 + + + + + + 4320 

attacccgtctgaagctt 
taatgggcagacttcgaa 

4321 + 4338 
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Rat SKI- 1 



121 



181 



GCGAGTAAACATCCCCCGAATGGATACCCGAGGCGTGTTCGCGGCGGAGGCCCCGTTTTC 60 
CGCTCATTTGTAGGGGGCTTACCTATGGGCTCCGCACAAGCGCCGCCTCCGGGGCAAAAG 

CCGGGTCCGCCGATCCCGAGCCTGAGGCGACGCAGATCGGCTCAGAGCGGTGGCTTGGGC 120 
GGCCCAGGCGGCTAGGGCTCGGACTCCGCTGCGTCTAGCCGAGTCTCGCCACCGAACCCG 

TCCTGCTAGATTTGGGTCTGTGGTACAAATGGAGTTTAGGACTCAGTGGACTCGGCCCTA 180 
AGGACGATCTAAACCCAGACACCATGTTTACCTCAAATCCTGAGTCACCTGAGCCGGGAT 

ATGAGAGAAGCCCCCTGTCCAAGATGGAGAAGAAGCGGAGAAAGAAATGAAAGCCTCTTT 240 
TACTCTCTTCGGGGGACAGGTTCTACCTCTTCTTCGCCTCTTTCTTTACTTTCGGAGAAA 



241 TTGGGCCAAGCTGTGGGTGACCATGGGACTGAGGTTTTCTTTACGTTGGACAAGTCTGTA 300 
AACCCGGTTCGACACCCACTGGTACCCTGACTCCAAAAGAAATGCAACCTGTTCAGACAT 

301 GGATGGCTGATCAGTAAGGTTGCAGCTTTTAGCGAAAACAGAAATCCACTTCTGATCAAG 360 
CCTACCGACTAGTCATTCCAACGTCGAAAATCGCTTTTGTCTTTAGGTGAAGACTAGTTC 



3 61 GAAGAGCCTAGTGCAATTTGAATTTATGCAATTTTATGACCATATTCACTTAGGACCATG 420 
CTTCTCGGATCACGTTAAACTTAAATACGTTAAAATACTGGTATAAGTGAATCCTGGTAC 

2KLVNIWLLLLVVLLCGKKHL 21 
421 AAGCTCGTCAACATCTGGCTTCTTCTGCTGGTGGTTTTGCTCTGTGGGAAAAAGCATCTG 4 80 
TTCGAGCAGTTGTAGACCGAAGAAGACGACCACCAAAACGAGACACCCTTTTTCGTAGAC 



22 GDRLGKKAFEKAPCPSCSHli 
481 GGTGACAGGCTGGGGAAGAAAGCTTTTGAAAAGGCCCCATGCCCCAGCTGTTCCCACCTG 
CCACTGTCCGACCCCTTCTTTCGAAAACTTTTCCGGGGTACGGGGTCGACAAGGGTGGAC 



540 



541 ACTTTGAAGGTGGAATTCTCCTCAACTGTGGTGGAATATGAATATATTGTGGCTTTCAAC 600 
TGAAACTTCCACCTTAAGAGGAGTTGACACCACCTTATACTTATATAACACCGAAAGTTG 



601 GGATACTTCACAGCCAAAGCTAGAAACTCATTTATTTCAAGTGCTCTAAAAAGCAGTGAA 660 
CCTATGAAGTGTCGGTTTCGATCTTTGAGTAAATAAAGTTCACGAGATTTTTCGTCACTT 

82 VDNWRI I PRNNPSSDYPSDF 101 
661 GTGGACAACTGGAGAATAATACCTCGGAACAACCCATCTAGTGACTACCCTAGTGATTTT 720 
CACCTGTTGACCTCTTATTATGGAGCCTTGTTGGGTAGATCACTGATGGGATCACTAAAA 

102 EVIQIKEKQKAGLLTLEDHP 121 
721 GAGGTGATTCAGATAAAAGAGAAGCAGAAGGCGGGGCTGCTCACACTTGAAGATCACCCA 780 
CTCCACTAAGTCTATTTTCTCTTCGTCTTCCGCCCCGACGAGTGTGAACTTCTAGTGGGT 
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122 NIKRVTPQRKVFRSLKFAES 141 
781 AACATCAAGCGGGTGACACCCCAGCGGAAAGTCTTTCGTTCCCTGAAGTTTGCTGAATCC 840 
TTGTAGTTCGCCCACTGTGGGGTCGCCTTTCAGAAAGCAAGGGACTTCAAACGACTTAGG 



142 DPIVPCNETRWSQKWQSSRP 161 
841 GACCCCATTGTGCCCTGTAATGAGACCCGGTGGAGCCAGAAGTGGCAGTCATCACGTCCC 900 
CTGGGGTAACACGGGACATTACTCTGGGCCACCTCGGTCTTCACCGTCAGTAGTGCAGGG 



162 LKRASLSLGSGFWHATGRHS 181 
901 CTGAAAAGAGCCAGTCTCTCCCTGGGCTCTGGATTCTGGCATGCAACAGGAAGGCATTCA 960 
GACTTTTCTCGGTCAGAGAGGGACCCGAGACCTAAGACCGTACGTTGTCCTTCCGTAAGT 



182 SRRLLRAI PRQVAQTLQADV 201 
961 AGTCGACGATTGCTGAGAGCCATTCCTCGCCAGGTTGCCCAGACATTGCAGGCAGATGTG 1020 
TCAGCTGCTAACGACTCTCGGTAAGGAGCGGTCCAACGGGTCTGTAACGTCCGTCTACAC 



202 LWQMGYTGANVRVAVFDTGL 221 
1021 CTTTGGCAGATGGGATACACAGGTGCTAATGTCAGGGTTGCCGTTTTTGATACTGGGCTC 1080 
GAAACCGTCTACCCTATGTGTCCACGATTACAGTCCCAACGGCAAAAACTATGACCCGAG 



222 S EKHPHFKNVKERTNWTNER 241 
1081 AGTGAGAAGCATCCACATTTCAAGAATGTGAAGGAAAGAACCAACTGGACCAATGAGCGG 114 0 
TCACTCTTCGTAGGTGTAAAGTTCTTACACTTCCTTTCTTGGTTGACCTGGTTACTCGCC 



242 TLDDGLGHGTFVAGVIASMR 261 
1141 ACCCTGGACGATGGGCTGGGCCATGGCACATTCGTTGCAGGTGTGATTGCCAGCATGAGA 1200 
TGGGACCTGCTACCCGACCCGGTACCGTGTAAGCAACGTCCACACTAACGGTCGTACTCT 



262 E CQGFAPDAELHI FRVFTNN 281 
1201 GAGTGCCAAGGATTTGCCCCAGATGCAGAGCTGCACATCTTCAGGGTCTTTACCAACAAT 1260 
CTCACGGTTCCTAAACGGGGTCTACGTCTCGACGTGTAGAAGTCCCAGAAATGGTTGTTA 



282 QVSYTSWFLDAFNYAILKKM 301 
1261 CAGGTGTCTTACACGTCTTGGTTTTTGGATGCCTTCAACTATGCCATCCTAAAGAAGATG 1320 
GTCCACAGAATGTGCAGAACCAAAAACCTACGGAAGTTGATACGGTAGGATTTCTTCTAC 



302 DVLNLSIGGPDFMDHPFVDK 321 
1321 GACGTTCTGAACCTTAGCATCGGTGGGCCTGACTTCATGGATCACCCCTTTGTTGACAAG 1380 
CTGCAAGACTTGGAATCGTAGCCACCCGGACTGAAGTACCTAGTGGGGAAACAACTGTTC 



322 VWELTANNVIMVSAIGNDGP 341 
1381 GTATGGGAATTAACAGCGAACAATGTAATCATGGTTTCTGCTATTGGCAATGATGGACCT 144 0 
CATACCCTTAATTGTCGCTTGTTACATTAGTACCAAAGACGATAACCGTTACTACCTGGA 



342 LYGTLNNPADQMDVIGVGGI 361 
1441 CTCTATGGCACTCTGAATAACCCTGCTGATCAGATGGATGTGATTGGAGTGGGTGGCATT 1500 
GAGATACCGTGAGACTTATTGGGACGACTAGTCTACCTACACTAACCTCACCCACCGTAA 



362 DFEDNIARFSSRGMTTWELP 381 
1501 GACTTTGAAGACAACATCGCCCGCTTCTCTTCCAGGGGAATGACTACCTGGGAACTACCG 1560 
CTGAAACTTCTGTTGTAGCGGGCGAAGAGAAGGTCCCCTTACTGATGGACCCTTGATGGC 
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382 GGYGRVKPDIVTYGAGVRGS 401 
1561 GGAGGCTATGGTCGTGTGAAGCCTGACATTGTCACCTATGGTGCTGGAGTGCGGGGTTCT 1620 
CCTCCGATACCAGCACACTTCGGACTGTAACAGTGGATACCACGACCTCACGCCCCAAGA 



402 GVKGGCRALSGTSVASPVVA 421 
1 621 GGTGTGAAAGGGGGCTGCCGTGCACTCTCAGGGACCAGTGTCGCCTCCCCAGTGGTTGCT 1680 
CCACACTTTCCCCCGACGGCACGTGAGAGTCCCTGGTCACAGCGGAGGGGTCACCAACGA 



422 GAVTLL.VSTVQKRELVNPAS 441 
1681 GGGGCTGTCACCTTGTT AGTAAGCACAGTACAGAAGCGGGAGCT AGTGAATCCTGCCAGT 1740 
CCCCGACAGTGGAACAATCATTCGTGTCATGTCTTCGCCCTCGATCACTTAGGACGGTCA 



442 VKQALIASARRLPGVNMFEQ 461 
1741 GTGAAGCAAGCTTTGATAGCATCAGCCCGGAGACTTCCTGGTGTCAACATGTTTGAGCAA 1800 
CACTTCGTTCGAAACTATCGTAGTCGGGCCTCTGAAGGACCACAGTTGTACAAACTCGTT 



462 GHGKLDLLRAYQILSSYKPQ 481 
1801 GGCCATGGCAAGTTGGATCTACTGCGAGCCTATCAGATCCTCAGCAGCTATAAACCGCAG 1860 
CCGGTACCGTTCAACCTAGATGACGCTCGGATAGTCTAGGAGTCGTCGATATTTGGCGTC 



482 ASLSPSYIDLTECPYMWPYC 501 
18 61 GCGAGCCTGAGTCCTAGCTACATCGACCTGACTGAGTGTCCCTACATGTGGCCCTACTGC 1920 
CGCTCGGACTCAGGATCGATGTAGCTGGACTGACTCACAGGGATGTACACCGGGATGACG 



502 SQPIYYGGMPTIVNVTILNG 521 
1 921 TCCCAGCCCATCTACTATGGAGGAATGCCAACAATTGTTAATGTCACCATCCTCAATGGC 1980 
AGGGTCGGGTAGATGATACCTCCTTACGGTTGTTAACAATTACAGTGGTAGGAGTTACCG 



522 MGVTGRIVDKPEWRPYLPQN 541 
1981 ATGGGAGTTACAGGAAGAATTGTGGATAAGCCTGAGTGGCGACCCTATTTACCACAGAAT 2040 
TACCCTCAATGTCCTTCTTAACACCTATTCGGACTCACCGCTGGGATAAATGGTGTCTTA 



542 GDNIEVAFSYSSVLWPWSGY 561 
2041 GGAGACAACATTGAAGTGGCCTTCTCCTACTCCTCAGTGTTGTGGCCTTGGTCAGGTTAC 2100 
CCTCTGTTGTAACTTCACCGGAAGAGGATGAGGAGTCACAACACCGGAACCAGTCCAATG 



562 LAI SISVTKKAASWEGIAQG 581 
2101 CTTGCCATCTCCATTTCTGTGACCAAGAAGGCAGCTTCCTGGGAAGGCATCGCGCAGGGC 2160 
GAACGGTAGAGGTAAAGACACTGGTTCTTCCGTCGAAGGACCCTTCCGTAGCGCGTCCCG 



582 H IMITVAS PAETELKNGAEH 601 
2161 CACATCATGATCACAGTGGCTTCCCCAGCAGAGACGGAATTAAAAAATGGTGCCGAGCAT 2220 
GTGTAGTACTAGTGTCACCGAAGGGGTCGTCTCTGCCTTAATTTTTTACCACGGCTCGTA 



602 TSTVKLPIKVKIIPTPPRSK 621 
2221 ACTTCCACAGTGAAGCTGCCCATCAAGGTGAAGATCATTCCCACCCCTCCTCGGAGCAAG 2280 
TGAAGGTGTCACTTCGACGGGTAGTTCCACTTCTAGTAAGGGTGGGGAGGAGCCTCGTTC 
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622 RVLKDQYHNLRYPPGYFPRD 641 
228 1 AGAGTCCTCTGGGACCAGTACCACAACCTCCGCTACCCACCCGGCTACTTCCCCAGGGAC 2340 
TCTCAGGAGACCCTGGTCATGGTGTTGGAGGCGATGGGTGGGCCGATGAAGGGGTCCCTG 



642 NLRMKNDPLDWNGDHVHTNF 661 
234 1 AACTTGCGGATGAAGAATGATCCTTTAGACTGGAATGGCGACCACGTCCACACCAACTTC 2400 
TTGAACGCCTACTTCTTACTAGGAAATCTGACCTTACCGCTGGTGCAGGTGTGGTTGAAG 



662 RDMYQHLRSMGYFVEVLGAP 681 
2401 AGGGACATGTACCAGCATCTGCGCAGCATGGGCTACTTTGTGGAGGTGCTTGGTGCCCCA 24 60 
TCCCTGTACATGGTCGTAGACGCGTCGTACCCGATGAAACACCTCCACGAACCACGGGGT 



682 FTCFDATQYGTLLMVDSEEE 701 
24 61 TTCACATGCTTTGACGCCACGCAGTACGGCACTCTGCTTATGGTGGACAGTGAGGAAGAG 2520 
AAGTGTACGAAACTGCGGTGCGTCATGCCGTGAGACGAATACCACCTGTCACTCCTTCTC 



702 YFPEEIAKLRRDVDNGLSLV 721 
2521 TACTTCCCTGAGGAGATTGCTAAGCTGAGGAGGGACGTGGACAATGGCCTTTCCCTTGTC 2580 
ATGAAGGGACTCCTCTAACGATTCGACTCCTCCCTGCACCTGTTACCGGAAAGGGAACAG 



722 VFSDWYNTSVMRKVKFYDEN 741 
2581 GTCTTCAGTGACTGGTACAACACTTCTGTTATGAGAAAAGTGAAGTTTTACGATGAAAAC 2640 
CAGAAGTCACTGACCATGTTGTGAAGACAATACTCTTTTCACTTCAAAATGCTACTTTTG 



742 TRQWWMPDTGGANVPALNEL 761 
2 64 1 ACAAGGCAGTGGTGGATGCCAGATACTGGAGGAGCCAACGTCCCAGCTCTAAACGAGCTG 2700 
TGTTCCGTCACCACCTACGGTCTATGACCTCCTCGGTTGCAGGGTCGAGATTTGCTCGAC 



762 LSVWNMGFS D GLYEGE FALA 781 
2701 CTGTCTGTGTGGAACATGGGGTTCAGTGACGGCCTGTATGAAGGGGAGTTTGCCCTGGCA 2760 
GACAGACACACCTTGTACCCCAAGTCACTGCCGGACATACTTCCCCTCAAACGGGACCGT 



782 NHDMYYASGCSIARFPEDGV 801 
2761 AACCACGACATGTACTATGCATCGGGGTGCAGCATTGCCAGGTTT CCAGAAGATGGTGTG 2820 
TTGGTGCTGTACATGATACGTAGCCCCACGTCGTAACGGTCCAAAGGTCTTCTACCACAC 



802 VITQTFKDQG LEVLKQETAV 821 
2821 GTGATCACACAGACTTTCAAGGACCAAGGATTGGAAGTCTTAAAACAAGAGACAGCAGTT 2880 
CACTAGTGTGTCTGAAAGTTCCTGGTTCCTAACCTTCAGAATTTTGTTCTCTGTCGTCAA 



822 VDNVPILGLYQI PAEGGGRI 841 
2881 GTCGACAATGTCCCCATTCTGGGGCTATATCAGATTCCAGCTGAAGGTGGAGGCCGGATT 2940 
CAGCTGTTACAGGGGTAAGACCCCGATATAGTCTAAGGTCGACTTCCACCTCCGGCCTAA 



842 VLYGDSNCLDDSHRQKDCFW 861 
2941 GTGCTGTATGGAGACTCCAACTGCTTGGATGACAGTCACAGACAGAAGGACTGCTTTTGG 3000 
CACGACATACCTCTGAGGTTGACGAACCTACTGTCAGTGTCTGTCTTCCTGACGAAAACC 
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862 LLDALLQYTSYGVTPPSLSH 881 
3001 CTTCTGGATGCACTCCTTCAGTACACATCCTATGGTGTGACCCCTCCCAGCCTCAGCCAT 3060 
GAAGACCTACGTGAGGAAGTCATGTGTAGGATACCACACTGGGGAGGGTCGGAGTCGGTA 



882 SGNRQRPPSGAGLAPPERME 901 
3061 TCAGGGAACCGGCAGCGCCCACCCAGCGGGGCTGGCTTGGCCCCTCCTGAAAGGATGGAA 3120 
AGTCCCTTGGCCGTCGCGGGTGGGTCGCCCCGACCGAACCGGGGAGGACTTTCCTACCTT 



902 GNHLHRYSKVLEAHLGDPKP 921 
3121 GGAAACCACCTTCATCGCTACTCCAAAGTTCTTGAGGCCCACTTGGGAGACCCGAAACCT 3180 
CCTTTGGTGGAAGTAGCGATGAGGTTTCAAGAACTCCGGGTGAACCCTCTGGGCTTTGGA 



922 RPLPACPHLSWAKPQPLNET 941 
3181 CGGCCCCTTCCAGCCTGTCCACACTTGTCGTGGGCCAAGCCACAGCCTTTGAATGAGACG 3240 
GCCGGGGAAGGTCGGACAGGTGTGAACAGCACCCGGTTCGGTGTCGGAAACTTACTCTGC 



942 APSNLWKHQKLLS IDLDKVV 961 
3241 GCACCCAGTAATCTTTGGAAACACCAGAAGCTGCTCTCCATTGACCTGGACAAAGTAGTG 3300 
CGTGGGTCATTAGAAACCTTTGTGGTCTTCGACGAGAGGTAACTGGACCTGTTTCATCAC 



962 LPNFRSNRPQVRPLSPGESG 981 
3301 TTACCCAACTTTCGCTCAAATCGCCCTCAAGTGAGACCTTTGTCCCCTGGAGAAAGTGGT 3360 
AATGGGTTGAAAGCGAGTTTAGCGGGAGTTCACTCTGGAAACAGGGGACCTCTTTCACCA 



982 AWDIPGGIMPGRYNQEVGQT 1001 
3361 GCCTGGGACATTCCTGGAGGGATCATGCCTGGCCGCTACAACCAGGAAGTAGGCCAGACC 3420 
CGGACCCTGTAAGGACCTCCCTAGTACGGACCGGCGATGTTGGTCCTTCATCCGGTCTGG 



1002 I PVFAFLGAMVALAFFVVQI 1021 
3421 ATCCCTGTTTTTGCCTTCCTTGGAGCCATGGTGGCCCTGGCCTTCTTCGTGGTACAGATC 34 80 
TAGGGACAAAAACGGAAGGAACCTCGGTACCACCGGGACCGGAAGAAGCACCATGTCTAG 



1022 SKAKSRPKRRR PRAKRPQLA 1041 
34 81 AGTAAGGCCAAGAGCCGGCCGAAGCGGAGGAGGCCCAGGGCAAAGCGTCCACAACTTGCA 3540 
TCATTCCGGTTCTCGGCCGGCTTCGCCTCCTCCGGGTCCCGTTTCGCAGGTGTTGAACGT 



1042 QQAHPARTPSV 1052 
3541 CAGCAGGCCCACCCTGCAAGGACCCCGTCAGTGTGATCATCACAGTGGCCAGACACAGAA 3600 
GTCGTCCGGGTGGGACGTTCCTGGGGCAGTCACACTAGTAGTGTCACCGGTCTGTGTCTT 



3601 GCTGACAAGCTTTGAACCCCTCTGGTGGCCACACAGCATCAGAGAGCATCCTGGGAAGTG 3660 
CGACTGTTCGAAACTTGGGGAGACCACCGGTGTGTCGTAGTCTCTCGTAGGACCCTTCAC 



3661 CCTGTTTCCAAGGAGCCCTATCTCTGGATTGTGGCTGGCTTAGTGTGTTCTGCCCAGACG 3720 
GGACAAAGGTTCCTCGGGATAGAGACCTAACACCGACCGAATCACACAAGACGGGTCTGC 



3721 TCTATGAGGTACATCCTGCAGTGCCT CACTGTGTTTGGCTCTGGCCGAAGGTGCCCAGTA 37 80 
AGATACTCCATGTAGGACGTCACGGAGTGACACAAACCGAGACCGGCTTCCACGGGTCAT 
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CCGACAGCCCTGTATGAAAGATGTATTACGATGTTGGGACTGGTTCGCTTCTGTA 
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Mouse SKI-1 



1 • MKLV STWLLVLVVLLCGK 18 

GCATTCCATGAAGCTCGTCAGCACCTGGCTTCTTGTGCTGGTGGTTTTGCTCTGTGGGAA 
1 CGTAAGGTACTTCGAGCAGTCGTGGACCGAAGAACACGACCACCAAAACGAGACACCCTT 60 



19 RHLGDRLGTRALEKAPCPSC 38 

ACGGCACCTGGGCGACAGGCTGGGGACGAGAGCTTTGGAAAAGGCCCCGTGCCCCAGCTG 
61 TGCCGTGGACCCGCTGTCCGACCCCTGCTCTCGAAACCTTTTCCGGGGCACGGGGTCGAC 120 



39 SHLTLKVEFS STVVEYEYIV 58 
CTCCCACCTGACTTTGAAGGTGGAATTCTCTTCAACTGTGGTGGAGTACGAATATATTGT 
121 GAGGGTGGACTGAAACTTCCACCTTAAGAGAAGTTGACACCACCTCATGCTTATATAACA 180 



59 AFNGY FTAKARN S F I S SALK 78 
GGCTTTCAACGGATACTTCACAGCCAAAGCTAGAAACTCATTTATTTCAAGTGCGCTGAA 
181 CCGAAAGTTGCCTATGAAGTGTCGGTTTCGATCTTTGAGTAAATAAAGTTCACGCGACTT 2 40 



79 SSEVENWRIIPRNNPSSDYP 98 
AAGCAGTGAAGTGGAAAACTGGAGAATAATACCTCGGAACAACCCATCCAGTGACTACCC 
241 TTCGTCACTTCACCTTTTGACCTCTTATTATGGAGCCTTGTTGGGTAGGTCACTGATGGG 300 



99 SDFEVI QIKEKQKAGLLTLE 118 
TAGTGATTTTGAGGTGATTCAGATAAAAGAGAAGCAGAAGGCGGGGCTGCTCACACTTGA 
301 ATCACTAAAACTCCACTAAGTCTATTTTCTCTTCGTCTTCCGCCCCGACGAGTGTGAACT 360 



119 DHPNIKRVTPQRKVFRSLKF 138 

AGATCACCCCAACATCAAGCGGGTGACACCCCAGCGGAAAGTCTTTCGTTCCCTCAAGTT 
3 61 TCTAGTGGGGTTGTAGTTCGCCCACTGTGGGGTCGCCTTTCAGAAAGCAAGGGAGTTCAA 420 



139 AESNPIVPCNETRWSQKWQS 158 

TGCTGAATCCAACCCCATCGTGCCCTGTAATGAAACCCGGTGGAGCCAGAAGTGGCAGTC 
421 ACGACTTAGGTTGGGGTAGCACGGGACATTACTTTGGGCCACCTCGGTCTTCACCGTCAG 4 80 



159 SRPLKRASLSLGSGFWHATG 178 

ATCACGTCCCCTGAAAAGAGCCAGTCTCTCCCTGGGCTCTGGATTCTGGCATGCAACAGG 
481 TAGTGCAGGGGACTTTTCTCGGTCAGAGAGGGACCCGAGACCTAAGACCGTACGTTGTCC 54 0 



179 RHSSRRLLRAIPRQVAQTLQ 198 

AAGACATTCAAGTCGGCGATTGCTGAGAGCCATTCCTCGCCAGGTCGCCCAGACACTGCA 
541 TTCTGTAAGTTCAGCCGCTAACGACTCTCGGTAAGGAGCGGTCCAGCGGGTCTGTGACGT 600 



199 ADVLWQMGYTGANVRVAVFD 218 

GGCAGATGTGCTGTGGCAGATGGGATACACAGGTGCTAATGTCAGAGTTGCTGTTTTTGA 
601 CCGTCTACACGACACCGTCTACCCTATGTGTCCACGATTACAGTCTCAACGACAAAAACT 660 



219 TGLSEKHPHFKNVKERTNWT 238 

TACTGGGCTCAGTGAGAAGCATCCGCATTTTAAGAATGTGAAGGAGAGAACCAACTGGAC 
661 ATGACCCGAGTCACTCTTCGTAGGCGTAAAATTCTTACACTTCCTCTCTTGGTTGACCTG 720 
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239 NERTLDDGLGHGTFVAGVIA 258 

CAATGAGCGGACCCTGGATGATGGGCTAGGCCATGGCACATTCGTTGCAGGTGTGATTGC 
721 GTTACTCGCCTGGGACCTACTACCCGATCCGGTACCGTGTAAGCAACGTCCACACTAACG 780 



259 SMRECQGFAPDAELH I FRVF 278 

C AGCATGAGGGAGTGCCAAGGATTTG CT C CAGAT GCAGAGCTGC ACAT CTTCAGGGT CTT 
7 81 GTCGTACTCCCTCACGGTTCCTAAACGAGGTCTACGTCTCGACGTGTAGAAGTCCCAGAA 84 0 



279 TNNQVSYTSWFLDAFNYAIL 298 

TACCAACAATCAGGTGTCTTACACATCTTGGTTTCTGGATGCCTTCAACTATGCCATCCT 
841 ATGGTTGTTAGTCCACAGAATGTGTAGAACCAAAGACCTACGGAAGTTGATACGGTAGGA 900 



299 KKMDVLNLS IGGPD FMDHPF 318 

AAAGAAGATGGACGTTCTCAACCTTAGCATCGGTGGGCCCGACTTCATGGATCATCCGTT 
901 TTTCTTCTACCTGCAAGAGTTGGAATCGTAGCCACCCGGGCTGAAGTACCTAGTAGGCAA 960 



319 VDKVWELTANNVIMVSAIGN 338 

TGTTGACAAGGTGTGGGAATTAACAGCTAACAATGTAATTATGGTTTCTGCTATTGGCAA 
961 ACAACTGTTCCACACCCTTAATTGTCGATTGTTACATTAATACCAAAGACGATAACCGTT 1020 



339 DGPLYGTLNNPADQMDVIGV 358 
TGATGGACCTCTCTATGGCACTCTGAATAACCCTGCTGATCAGATGGATGTGATTGGAGT 
1021 ACTACCTGGAGAGATACCGTGAGACTTATTGGGACGACTAGTCTACCTACACTAACCTCA 1080 



359 GGI DFEDN IARFSS RGMTTW 378 
GGGTGGCATTGACTTTGAAGATAACATCGCTCGCTTTTCTTCCAGGGGAATGACTACCTG 
1081 CCCACCGTAACTGAAACTTCTATTGTAGCGAGCGAAAAGAAGGTCCCCTTACTGATGGAC 114 0 



379 ELPGGYGRVKPDIVTYGAGV 398 
GGAATTACCAGGAGGCTATGGTCGTGTGAAGCCTGACATTGTCACCTATGGTGCTGGAGT 
1141 CCTTAATGGTCCTCCGATACCAGCACACTTCGGACTGTAACAGTGGATACCACGACCTCA 1200 



399 RGSGVKGGCRALSGTSVAS P 418 
GCGGGGTTCCGGTGTGAAAGGGGGCTGCCGTGCACTCTCAGGGACCAGTGTCGCTTCCCC 
1201 CGCCCCAAGGCCACACTTTCCCCCGACGGCACGTGAGAGTCCCTGGTCACAGCGAAGGGG 1260 



419 VVAGAVTL LVSTVQKRELVN 438 
AGTGGTCGCTGGGGCCGTCACCTTGTTAGTAAGCACAGTACAGAAGCGGGAGCTGGTGAA 
1261 TCACCAGCGACCCCGGCAGTGGAACAATCATTCGTGTCATGTCTTCGCCCTCGACCACTT 1320 



439 PASVKQALIASARRLPGVNM 458 
TCCTGCCAGTGTGAAGCAAGCTTTGATAGCGTCAGCCCGGAGACTTCCTGGGGTCAACAT 
1321 AGGACGGTCACACTT CGTTCGAAACTATCGCAGTCGGGCCTCTGAAGGACCCCAGTTGTA 1380 



459 FEQGHGKLDLLRAYQILSSY 478 
GTTCGAGCAAGGTCATGGCAAGTTGGATCTGCTGCGAGCTTATCAGATCCTCAGCAGCTA 
1381 CAAGCTCGTTCCAGTACCGTTCAACCTAGACGACGCTCGAATAGTCTAGGAGTCGTCGAT 1440 
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479 KPQASLSPSYIDLTECPYMW 498 
TAAACCGCAGGCAAGCCTGAGTCCTAGCTACATCGACCTGACTGAGTGTCCCTACATGTG 
14 41 ATTTGGCGTCCGTTCGGACTCAGGATCGATGTAGCTGGACTGACTCACAGGGATGTACAC 1500 



499 PYCSQPIYYGGMPTIVNVTI 518 
GCCCTACTGCTCCCAGCCTATCTACTATGGAGGAATGCCAACAATCGTTAATGTCACCAT 
1501 CGGGATGACGAGGGTCGGATAGATGATACCTCCTTACGGTTGTTAGCAATTACAGTGGTA 1560 



519 LNGMGVTGRIVDKPEWRPYL 538 
CCTCAATGGCATGGGCGTCACAGGAAGAATTGTGGATAAGCCTGAGTGGCGACCCTATTT 
1561 GGAGTTACCGTACCCGCAGTGTCCTTCTTAACACCTATTCGGACTCACCGCTGGGATAAA 1620 



539 PQNGDNIEVAFSYSSVLWPW 558 
ACCACAGAATGGAGACAACATTGAAGTGGCCTTCTCCTACTCCTCAGTGTTGTGGCCCTG 
1 62 1 TGGTGTCTTACCTCTGTTGTAACTTCACCGGAAGAGGATGAGGAGTCACAACACCGGGAC 1680 



559 SGYLAI SISVTKKAASWEGI 578 
GTCAGGTTACCTTGCCATCTCCATTTCTGTGACCAAGAAGGCAGCTTCCTGGGAAGGCAT 
1681 CAGTCCAATGGAACGGTAGAGGTAAAGACACTGGTTCTTCCGTCGAAGGACCCTTCCGTA 1740 



579 AQGHIMITVASPAETELHSG 598 
CGCTCAGGGCCACATCATGATCACAGTGGCGTCCCCAGCAGAGACAGAGTTACACAGTGG 
1741 GCGAGTCCCGGTGTAGTACTAGTGTCACCGCAGGGGTCGTCTCTGTCTCAATGTGTCACC 1800 



599 AEHTSTVKLPIKVKIIPTPP 618 
TGCGGAGCACACTTCCACCGTGAAGCTGCCCATCAAGGTGAAGATCATTCCCACCCCTCC 
1801 ACGCCTCGTGTGAAGGTGGCACTTCGACGGGTAGTTCCACTTCTAGTAAGGGTGGGGAGG 1860 



619 RSKRVLWDQYHNLRYPPGYF 638 
TCGGAGCAAGAGAGTCCTCTGGGACCAGTACCACAACCTCCGCTACCCACCTGGCTACTT 
1861 AGCCTCGTTCTCTCAGGAGACCCTGGTCATGGTGTTGGAGGCGATGGGTGGACCGATGAA 1920 



639 PRDNLRMKNDPLDWNGDHVH 658 
CCCCAGGGACAACTTGCGGATGAAGAATGACCCTTTAGACTGGAATGGCGACCACGTCCA 
1 921 GGGGTCCCTGTTGAACGCCTACTTCTTACTGGGAAATCTGACCTTACCGCTGGTGCAGGT 1980 



659 TNFRDMYQHLRSMGYFVEVL 678 
CACCAACTTCAGGGACATGTACCAGCATCTGCGCAGCATGGGCTACTTCGTGGAGGTGCT 
1981 GTGGTTGAAGTCCCTGTACATGGTCGTAGACGCGTCGTACCCGATGAAGCACCTCCACGA 2040 



67 9 GAPFTCFDATQYGTLLLVDS 698 
CGGCGCCCCATTCACATGTTTTGACGCCACACAGTATGGCACTTTGCTGCTGGTGGACAG 
2041 GCCGCGGGGTAAGTGTACAAAACTGCGGTGTGTCATACCGTGAAACGACGACCACCTGTC 2100 



699 EEEYFPEEIAKLRRDVDNGL 718 
TGAGGAAGAGTACTTCCCTGAGGAGATTGCTAAGCTGAGGAGGGATGTGGACAATGGCCT 
2101 ACTCCTTCTCATGAAGGGACTCCTCTAACGATTCGACTCCTCCCTACACCTGTTACCGGA 2160 



719 SLVIFSDWYNTSVMRKVKFY 738 
TTCCCTCGTCATCTTCAGTGACTGGTACAACACTTCTGTTATGAGAAAAGTGAAGTTTTA 
2161 AAGGGAGCAGTAGAAGTCACTGACCATGTTGTGAAGACAATACTCTTTTCACTTCAAAAT 2220 
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739 DENTRQWWMPDTGGAN I P A L 758 
TGATGAAAACACCAGGCAGTGGTGGATGCCAGACACCGGAGGAGCGAACATCCCAGCTCT 
2221 ACTACTTTTGTGGTCCGTCACCACCTACGGTCTGTGGCCTCCTCGCTTGTAGGGTCGAGA 2280 



759 NELLSVWNMGFSDGLYEGEF 778 
GAATGAGCTGCTGTCTGTGTGGAACATGGGGTTCAGTGACGGCCTATATGAAGGGGAGTT 
2281 CTTACTCGACGACAGACACACCTTGTACCCCAAGTCACTGCCGGATATACTTCCCCTCAA 234 0 



779 VLANHDMYYASGCSIAKFPE 798 
TGTCCTGGCAAACCATGACATGTACTATGCGTCGGGGTGCAGCATCGCCAAGTTTCCAGA 
2341 ACAGGACCGTTTGGTACTGTACATGATACGCAGCCCCACGTCGTAGCGGTTCAAAGGTCT 2400 



799 DGVVITQTFKDQGLEVLKQE 818 
AGATGGCGTCGTGATCACACAGACTTTCAAGGACCAAGGATTGGAGGTCTTAAAACAAGA 
24 01 TCTACCGCAGCACTAGTGTGTCTGAAAGTTCCTGGTTCCTAACCTCCAGAATTTTGTTCT 24 60 



819 TAVVENVPILGLYQIPSEGG 838 
GACAGCAGTTGTGGAAAATGTTCCCATTTTGGGGCTTTATCAGATTCCATCTGAAGGTGG 
24 61 CTGTCGTCAACACCTTTTACAAGGGTAAAACCCCGAAATAGTCTAAGGTAGACTTCCACC 2520 



839 GRIVLYGDSNCLDDSHRQKD 858 
AGGCCGGATCGTGCTGTATGGAGACTCCAACTGCTTGGATGACAGTCACAGACAGAAGGA 
2521 TCCGGCCTAGCACGACATACCTCTGAGGTTGACGAACCTACTGTCAGTGTCTGTCTTCCT 2580 



859 CFWLLDALLQYTSYGVTPPS 878 
CTGCTTTTGGCTTCTGGATGCGCTCCTTCAGTACACATCCTATGGCGTGACCCCTCCCAG 
2581 GACGAAAACCGAAGACCTACGCGAGGAAGTCATGTGTAGGATACCGC ACTGGGGAGGGTC 2 64 0 



879 LSHSGNRQRPPSGAGLAPPE 898 
CCTCAGCCATTCAGGGAACCGGCAGCGCCCACCTAGCGGAGCCGGCTTGGCCCCTCCTGA 
2641 GGAGTCGGTAAGTCCCTTGGCCGTCGCGGGTGGATCGCCTCGGCCGAACCGGGGAGGACT 2700 



899 RMEGNHLHRYSKVLEAHLGD 918 
AAGGATGGAAGGAAACCACCTCCATCGGTACTCCAAAGTTCTTGAAGCCCACTTGGGAGA 
2701 TTCCTACCTTCCTTTGGTGGAGGTAGCCATGAGGTTTCAAGAACTTCGGGTGAACCCTCT 2760 



919 PKPRPLPACPHLSWAKPQPL 938 
CCCGAAACCTCGGCCCCTGCCAGCCTGTCCACATTTGTCATGGGCCAAGCCACAGCCTTT 
27 61 GGGCTTTGGAGCCGGGGACGGTCGGACAGGTGTAAACAGTACCCGGTTCGGTGTCGGAAA 2820 



939 NETAPSNLWKHQKLLS IDLD 958 
GAATGAGACGGCACCCAGTAATCTTTGGAAACATCAGAAGCTGCTCTCCATTGACCTGGA 
2821 CTTACTCTGCCGTGGGTCATTAGAAACCTTTGTAGTCTTCGACGAGAGGTAACTGGACCT 2880 



959 KVVLPNFRSNRPQVRPLSPG 978 
CAAAGTAGTGTTACCCAACTTTCGATCCAATCGCCCTCAAGTGAGACCTTTGTCCCCTGG 
2881 GTTTCATCACAATGGGTTGAAAGCTAGGTTAGCGGGAGTTCACTCTGGAAACAGGGGACC 2 94 0 
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979 ESGAWDIPGGIMPGRYNQEV 998 
AGAGAGTGGTGCCTGGGACATTCCTGGAGGGATCATGCCTGGCCGCTACAACCAGGAGGT 
2941 TCTCTCACCACGGACCCTGTAAGGACCTCCCTAGTACGGACCGGCGATGTTGGTCCTCCA 3000 



999 GQTIPVFAFLGAMVALAFFV 1018 
GGGACAGACCATCCCCGTCTTCGCCTTCCTCGGAGCCATGGTGGCCCTGGCCTTCTTTGT 
3001 CCCTGTCTGGTAGGGGCAGAAGCGGAAGGAGCCTCGGTACCACCGGGACCGGAAGAAACA 3060 



1019 VQISKAKSRPKRRRPRAKRP 1038 

GGTACAGATCAGCAAGGCCAAGAGCCGGCCGAAGCGGAGGAGGCCCAGGGCAAAGCGTCC 
3061 CCATGTCTAGTCGTTCCGGTTCTCGGCCGGCTTCGCCTCCTCCGGGTCCCGTTTCGCAGG 3120 



1039 QLAQQAHPARTPSV 1052 

ACAACTTGCACAGCAGGCCCACCCTGCAAGGACCCCATCAGTGTGAGCATCGCAGTAGCC 
3121 TGTTGAACGTGTCGTCCGGGTGGGACGTTCCTGGGGTAGTCACACTCGTAGCGTCATCGG 3180 



AGCCACAGAAGCTAACAAGCCTTGAACCACTCTGGTGGCCACACAGCGCCTCAGAGAGCA 
3181 TCGGTGTCTTCGATTGTTCGGAACTTGGTGAGACCACCGGTGTGTCGCGGAGTCTCTCGT 3240 



TTCTGGGAAGTGCCTGTTTCCGAGGACCCTGTCTCCAGCTTGTGGCTATCTTACTGTGTT 
3241 AAGACCCTTCACGGACAAAGGCTCCTGGGACAGAGGTCGAACACCGATAGAATGACACAA 3300 



CTGCCCAGGCACCTGATGAGGTACATCCTGCAGTGCCTCTCTGTGCTTGGCTCTGGCAGA 
3301 GACGGGTCCGTGGACTACTCCATGTAGGACGTCACGGAGAGACACGAACCGAGACCGTCT 3360 



AGGCACCCAGTGACATCAGGCATCAGGCCCAGTGACAGTGCACCAAAGACACAGAGCCTG 
3361 TCCGTGGGTCACTGTAGTCCGTAGTCCGGGTCACTGTCACGTGGTTTCTGTGTCTCGGAC 3420 



GAAGGGCTGTCGGGACATACTTTCTACATAACGCTACAACCCTGACCAAGCAAAGACATG 
3 421 CTTCCCGACAGCCCTGTATGAAAGATGTATTGCGATGTTGGGACTGGTTCGTTT CTGTAC 3480 



CTTGTTACAGGCTATTTTCTATATTTATTGTGGGAGAGTCACTTTAAAGACTGTGCTAGT 
3481 GAACAATGTCCGATAAAAGATATAAATAACACCCTCTCAGTGAAATTTCTGACACGATCA 3540 



TGGAAACAGAGCTGTTGCTGTTGTCAGTCGAGTGCAGTTTTCTGCAGCGATGTCATAAGG 
3541 ACCTTTGTCTCGACAACGACAACAGTCAGCTCACGTCAAAAGACGTCGCTACAGTATTCC 3600 



AGTCAGATTCCGTGACCTCCTCTTTGATGGAGGACACACTGAACTGAAGGGGACTTGCGC 
3601 TCAGTCTAAGGCACTGGAGGAGAAACTACCTCCTGTGTGACTTGACTTCCCCTGAACGCG 3660 



GGATGTGGGAGATGCAAGCCTTCGCTTTATTTTTTTATAACTATCAACTGCCATCATGTT 
3661 CCTACACCCTCTACGTTCGGAAGCGAAATAAAAAAATATTGATAGTTGACGGTAGTACAA 3720 



TTGTAATTTGGGGATCTTGATTTCACCGTTGTTGGTGAAGGAAATTTTCAATAAATATGC 
3721 AACATTAAACCCCTAGAACTAAAGTGGCAACAACCACTTCCTTTAAAAGTTATTTATACG 3780 



ATAACCTT 
3781 TATTGGAA 3788 
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What is claimed is: 

1 . A soluble proteio fragment of a subtilisin-kexin isoenzyme named SKl-1 which has 
the amino acid sequence defined by amino acids 187 to 996 of any one of SEQ ID NOs. 

2. 4 and 6, and a variant thereof, which is enzymatically active. 

5 2. A proteic fragment of a subtilisin-kexin isoenzyme named SKI-1 , which has the 
amino acid sequence defined by amino acids 1 8 to 137 of any one of SEQ ID NOs: 2, 4 
and 6, and a variant thereof, which is capable of binding with amino acids 18 to 1052 of 
SKI-1 in whole or in part 

3. The proteic fragment of claim 2, wherein said part has a molecular weight of about 
10 14 KDa and forms a tight complex with the soluble fragment of SKI-1 as defined in 

claim 1 . 

4. The proteic fragment of claim 2, which is an inhibitor of SKI-1 activity. 

5. The proteic fragment of claim 4, wherein the SKI-1 amino acid sequence that is 
modified to prevent further enzymatic processing in a cell expressing said proteic 

1 5 fragment. 

6. The proteic fragment of claim 5, which is modified by amino acid substitution, 
deletion or rearrangement. 

7. An isolated nucleic acid encoding a protein fragment as defined in claim 1 . 

8. An isolated nucleic acid encoding a proteic fragment as defined in claim 2. 
20 9. An isolated nucleic acid encoding a proteic fragment as defined in claim 3. 

10. An isolated nucleic acid encoding a proteic fragment as defined in any one of 
claims 4 to 6. 

11. A recombinant vector comprising the nucleic acid defined in any one of claims 7 
to 10. 

25 1 2. The recombinant vector of claim 1 1 , which is an expression vector. 
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MAMMALIAN SUBTILISIN/KEXIN ISOZYME SKI-1: A PROPROTEIN CONVERTASE 
WITH A UNIQUE CLEAVAGE SPECIFICITY 

the specification of which (check only one item below): 
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<110> Institut de Recherches Cliniques de Montreal 
SEIDAH, Nabil 
CHRETIEN, Michel 
MARCINKIEWICZ, Mieczyslaw 
LAAKSONEN, Reijo 
DAVIGNON, Jean 



<120> MAMMALIAN SUBTILISIN/KEXIN ISOZYME SKI-1: A PROPROTEIN 
C ON VERT AS E WITH A UNIQUE CLEAVAGE SPECIFICITY 



<130> IRCM 



<140> PCT/CA99/01058 
<141> 1999-11-04 



<150> CA 2,249,648 
<151> 1998-11-04 



<160> 76 



<170> Patentln Ver. 2.1 



<210> 1 
<211> 3895 
<212> DNA 
<213> Rattus sp. 

<220> 
<221> CDS 

<222> (418) . . (3573) 



<400> 1 

gcgagtaaac atcccccgaa tggatacccg aggcgtgttc gcggcggagg ccccgttttc 60 
ccgggtccgc cgatcccgag cctgaggcga cgcagatcgg ctcagagcgg tggcttgggc 120 
tcctgctaga tttgggtctg tggtacaaat ggagtttagg actcagtgga ctcggcccta 180 
atgagagaag ccccctgtcc aagatggaga agaagcggag aaagaaatga aagcctcttt 24 0 
ttgggccaag ctgtgggtga ccatgggact gaggttttct ttacgttgga caagtctgta 300 
ggatggctga tcagtaaggt tgcagctttt agcgaaaaca gaaatccact tctgatcaag 360 
gaagagccta gtgcaatttg aatttatgca attttatgac catattcact taggacc 417 



1 



atg aag etc gtc aac ate tgg ctt ctt ctg ctg gtg gtt ttg etc tgt 

Met Lys Leu Val Asn lie Trp Leu Leu Leu Leu Val Val Leu Leu Cys 

15 10 15 

ggg aaa aag cat ctg ggt gac agg ctg ggg aag aaa get ttt gaa aag 

Gly Lys Lys His Leu Gly Asp Arg Leu Gly Lys Lys Ala Phe Glu Lys 

20 25 30 

gec cca tgc ccc age tgt tec cac ctg act ttg aag gtg gaa ttc tec 

Ala Pro Cys Pro Ser Cys Ser His Leu Thr Leu Lys Val Glu Phe Ser 



tea act gtg gtg gaa tat gaa tat att gtg get ttc aac gga tac ttc 
Ser Thr Val Val Glu Tyr Glu Tyr lie Val Ala Phe Asn Gly Tyr Phe 



aca gee aaa get aga aac tea ttt att tea agt get eta aaa age agt 
Thr Ala Lys Ala Arg Asn Ser Phe lie Ser Ser Ala Leu Lys Ser Ser 



gaa gtg gac aac tgg aga ata ata cct egg aac aac cca tct agt gac 
Glu Val Asp Asn Trp Arg lie lie Pro Arg Asn Asn Pro Ser Ser Asp 



tac cct agt gat ttt gag gtg att cag ata aaa gag aag cag aag gcg 
Tyr Pro Ser Asp Phe Glu Val lie Gin lie Lys Glu Lys Gin Lys Ala 
100 105 110 

ggg ctg etc aca ctt gaa gat cac cca aac ate aag egg gtg aca ccc 
Gly Leu Leu Thr Leu Glu Asp His Pro Asn lie Lys Arg Val Thr Pro 
115 120 125 

cag egg aaa gtc ttt cgt tec ctg aag ttt get gaa tec gac ccc att 
Gin Arg Lys Val Phe Arg Ser Leu Lys Phe Ala Glu Ser Asp Pro lie 
130 135 140 

gtg ccc tgt aat gag ace egg tgg age cag aag tgg cag tea tea cgt 
Val Pro Cys Asn Glu Thr Arg Trp Ser Gin Lys Trp Gin Ser Ser Arg 
145 150 155 160 

ccc ctg aaa aga gec agt etc tec ctg ggc tct gga ttc tgg cat gca 
Pro Leu Lys Arg Ala Ser Leu Ser Leu Gly Ser Gly Phe Trp His Ala 
165 170 175 

aca gga agg cat tea agt cga cga ttg ctg aga gec att cct cgc cag 
Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala lie Pro Arg Gin 
180 185 190 



2 



gtt gcc cag 
Val Ala Gin 
195 

ggt get aat 
Gly Ala Asn 
210 

cat cca cat 
His Pro His 
225 

egg acc ctg 
Arg Thr Leu 



att gcc age 
lie Ala Ser 



cac ate ttc 
His He Phe 
275 

ttt ttg gat 
Phe Leu Asp 
290 

aac ctt age 
Asn Leu Ser 
305 

aag gta tgg 
Lys Val Trp 



ggc aat gat 
Gly Asn Asp 



atg gat gtg 
Met Asp Val 
355 

cgc ttc tct 
Arg Phe Ser 
370 



aca ttg cag 
Thr Leu Gin 



gtc agg gtt 
Val Arg Val 



ttc aag aat 
Phe Lys Asn 
230 

gac gat ggg 
Asp Asp Gly 
245 

atg aga gag 
Met Arg Glu 
260 

agg gtc ttt 
Arg Val Phe 



gcc ttc aac 
Ala Phe Asn 



ate ggt ggg 
He Gly Gly 
310 

gaa tta aca 
Glu Leu Thr 
325 

gga cct etc 
Gly Pro Leu 
340 

att gga gtg 
He Gly Val 



tec agg gga 
Ser Arg Gly 



gca gat gtg 
Ala Asp Val 
200 

gcc gtt ttt 
Ala Val Phe 
215 

gtg aag gaa 
Val Lys Glu 



ctg ggc cat 
Leu Gly His 



tgc caa gga 
Cys Gin Gly 
265 

acc aac aat 
Thr Asn Asn 
280 

tat gcc ate 
Tyr Ala lie 
295 

cct gac ttc 
Pro Asp Phe 



gcg aac aat 
Ala Asn Asn 



tat ggc act 
Tyr Gly Thr 
345 

ggt ggc att 
Gly Gly He 
360 

atg act acc 
Met Thr Thr 
375 



ctt tgg cag 
Leu Trp Gin 



gat act ggg 
Asp Thr Gly 
220 

aga acc aac 
Arg Thr Asn 
235 

ggc aca ttc 
Gly Thr Phe 
250 

ttt gcc cca 
Phe Ala Pro 



cag gtg tct 
Gin Val Ser 



eta aag aag 
Leu Lys Lys 
300 

atg gat cac 
Met Asp His 
315 

gta ate atg 
Val He Met 
330 

ctg aat aac 
Leu Asn Asn 



gac ttt gaa 
Asp Phe Glu 



tgg gaa eta 
Trp Glu Leu 
380 



atg gga tac 
Met Gly Tyr 
205 

etc agt gag 
Leu Ser Glu 



tgg acc aat 
Trp Thr Asn 



gtt gca ggt 
Val Ala Gly 
255 

gat gca gag 
Asp Ala Glu 
270 

tac acg tct 
Tyr Thr Ser 
285 

atg gac gtt 
Met Asp Val 



ccc ttt gtt 
Pro Phe Val 



gtt tct get 
Val Ser Ala 
335 

cct get gat 
Pro Ala Asp 
350 

gac aac ate 
Asp Asn He 
365 

ccg gga ggc 
Pro Gly Gly 



aca 1041 
Thr 



aag 1089 
Lys 



gag 1137 

Glu 

240 

gtg 1185 
Val 



ctg 1233 
Leu 



tgg 1281 
Trp 



ctg 1329 
Leu 



gac 1377 

Asp 

320 

att 1425 
He 



cag 1473 
Gin 



gcc 1521 
Ala 



tat 1569 
Tyr 
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ggt cgt gtg aag cct gac att gtc acc tat ggt get gga gtg egg ggt 
Gly Arg Val Lys Pro Asp lie Val Thr Tyr Gly Ala Gly Val Arg Gly 
385 390 395 400 

tct ggt gtg aaa ggg ggc tgc cgt gca etc tea ggg acc agt gtc gec 
Ser Gly Val Lys Gly Gly Cys Arg Ala Leu Ser Gly Thr Ser Val Ala 
405 410 415 

tec cca gtg gtt get ggg get gtc acc ttg tta gta age aca gta cag 
Ser Pro Val Val Ala Gly Ala Val Thr Leu Leu Val Ser Thr Val Gin 



aag egg gag eta gtg aat cct gec agt gtg aag caa get ttg ata gca 
Lys Arg Glu Leu Val Asn Pro Ala Ser Val Lys Gin Ala Leu lie Ala 
435 440 445 

tea gec egg aga ctt cct ggt gtc aac atg ttt gag caa ggc cat ggc 
Ser Ala Arg Arg Leu Pro Gly Val Asn Met Phe Glu Gin Gly His Gly 
450 455 460 

aag ttg gat eta ctg cga gee tat cag ate etc age age tat aaa ccg 
Lys Leu Asp Leu Leu Arg Ala Tyr Gin lie Leu Ser Ser Tyr Lys Pro 
465 470 475 480 

cag gcg age ctg agt cct age tac ate gac ctg act gag tgt ccc tac 
Gin Ala Ser Leu Ser Pro Ser Tyr lie Asp Leu Thr Glu Cys Pro Tyr 
485 490 495 

atg tgg ccc tac tgc tec cag ccc ate tac tat gga gga atg cca aca 
Met Trp Pro Tyr Cys Ser Gin Pro lie Tyr Tyr Gly Gly Met Pro Thr 
500 505 510 

att gtt aat gtc acc ate etc aat ggc atg gga gtt aca gga aga att 
lie Val Asn Val Thr lie Leu Asn Gly Met Gly Val Thr Gly Arg lie 
515 520 525 

gtg gat aag cct gag tgg cga ccc tat tta cca cag aat gga gac aac 
Val Asp Lys Pro Glu Trp Arg Pro Tyr Leu Pro Gin Asn Gly Asp Asn 
530 535 540 

att gaa gtg gee ttc tec tac tec tea gtg ttg tgg cct tgg tea ggt 
lie Glu Val Ala Phe Ser Tyr Ser Ser Val Leu Trp Pro Trp Ser Gly 
545 550 555 560 

tac ctt gee ate tec att tct gtg acc aag aag gca get tec tgg gaa 
Tyr Leu Ala lie Ser lie Ser Val Thr Lys Lys Ala Ala Ser Trp Glu 
565 570 575 



ggc ate gcg cag ggc cac ate atg ate aca gtg get tec cca gca gag 
Gly He Ala Gin Gly His He Met He Thr Val Ala Ser Pro Ala Glu 
580 585 590 

acg gaa tta aaa aat ggt gec gag cat act tec aca gtg aag ctg ccc 
Thr Glu Leu Lys Asn Gly Ala Glu His Thr Ser Thr Val Lys Leu Pro 
595 600 605 

ate aag gtg aag ate att ccc ace cct cct egg age aag aga gtc etc 
He Lys Val Lys He lie Pro Thr Pro Pro Arg Ser Lys Arg Val Leu 
610 615 620 

tgg gac cag tac cac aac etc cgc tac cca ccc ggc tac ttc ccc agg 
Trp Asp Gin Tyr His Asn Leu Arg Tyr Pro Pro Gly Tyr Phe Pro Arg 
625 630 635 640 

gac aac ttg egg atg aag aat gat cct tta gac tgg aat ggc gac cac 
Asp Asn Leu Arg Met Lys Asn Asp Pro Leu Asp Trp Asn Gly Asp His 
645 650 655 

gtc cac acc aac ttc agg gac atg tac cag cat ctg cgc age atg ggc 
Val His Thr Asn Phe Arg Asp Met Tyr Gin His Leu Arg Ser Met Gly 
660 665 670 

tac ttt gtg gag gtg ctt ggt gec cca ttc aca tgc ttt gac gee acg 
Tyr Phe Val Glu Val Leu Gly Ala Pro Phe Thr Cys Phe Asp Ala Thr 
675 680 685 

cag tac ggc act ctg ctt atg gtg gac agt gag gaa gag tac ttc cct 
Gin Tyr Gly Thr Leu Leu Met Val Asp Ser Glu Glu Glu Tyr Phe Pro 
690 695 700 

gag gag att get aag ctg agg agg gac gtg gac aat ggc ctt tec ctt 
Glu Glu He Ala Lys Leu Arg Arg Asp Val Asp Asn Gly Leu Ser Leu 
705 710 715 720 

gtc gtc ttc agt gac tgg tac aac act tct gtt atg aga aaa gtg aag 
Val Val Phe Ser Asp Trp Tyr Asn Thr Ser Val Met Arg Lys Val Lys 
725 730 735 

ttt tac gat gaa aac aca agg cag tgg tgg atg cca gat act gga gga 
Phe Tyr Asp Glu Asn Thr Arg Gin Trp Trp Met Pro Asp Thr Gly Gly 
740 745 750 

gee aac gtc cca get eta aac gag ctg ctg tct gtg tgg aac atg ggg 
Ala Asn Val Pro Ala Leu Asn Glu Leu Leu Ser Val Trp Asn Met Gly 
755 760 765 
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ttc agt gac ggc ctg tat gaa ggg gag ttt gcc ctg gca aac cac gac 
Phe Ser Asp Gly Leu Tyr Glu Gly Glu Phe Ala Leu Ala Asn His Asp 
770 775 780 

atg tac tat gca teg ggg tgc age att gcc agg ttt cca gaa gat ggt 
Met Tyr Tyr Ala Ser Gly Cys Ser lie Ala Arg Phe Pro Glu Asp Gly 
785 790 795 800 

gtg gtg ate aca cag act ttc aag gac caa gga ttg gaa gtc tta aaa 
Val Val lie Thr Gin Thr Phe Lys Asp Gin Gly Leu Glu Val Leu Lys 
805 810 815 

caa gag aca gca gtt gtc gac aat gtc ccc att ctg ggg eta tat cag 
Gin Glu Thr Ala Val Val Asp Asn Val Pro lie Leu Gly Leu Tyr Gin 
820 825 830 

att cca get gaa ggt gga ggc egg att gtg ctg tat gga gac tec aac 
lie Pro Ala Glu Gly Gly Gly Arg lie Val Leu Tyr Gly Asp Ser Asn 
835 840 845 

tgc ttg gat gac agt cac aga cag aag gac tgc ttt tgg ctt ctg gat 
Cys Leu Asp Asp Ser His Arg Gin Lys Asp Cys Phe Trp Leu Leu Asp 
850 855 860 

gca etc ctt cag tac aca tec tat ggt gtg ace cct ccc age etc age 
Ala Leu Leu Gin Tyr Thr Ser Tyr Gly Val Thr Pro Pro Ser Leu Ser 
865 870 875 880 

cat tea ggg aac egg cag cgc cca ccc age ggg get ggc ttg gcc cct 
His Ser Gly Asn Arg Gin Arg Pro Pro Ser Gly Ala Gly Leu Ala Pro 
885 890 895 

cct gaa agg atg gaa gga aac cac ctt cat cgc tac tec aaa gtt ctt 
Pro Glu Arg Met Glu Gly Asn His Leu His Arg Tyr Ser Lys Val Leu 
900 905 910 

gag gcc cac ttg gga gac ccg aaa cct egg ccc ctt cca gcc tgt cca 
Glu Ala His Leu Gly Asp Pro Lys Pro Arg Pro Leu Pro Ala Cys Pro 
915 920 925 

cac ttg teg tgg gcc aag cca cag cct ttg aat gag acg gca ccc agt 
His Leu Ser Trp Ala Lys Pro Gin Pro Leu Asn Glu Thr Ala Pro Ser 
930 935 940 

aat ctt tgg aaa cac cag aag ctg etc tec att gac ctg gac aaa gta 
Asn Leu Trp Lys His Gin Lys Leu Leu Ser lie Asp Leu Asp Lys Val 
945 950 955 960 
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gtg tta ccc aac ttt cgc tea aat cgc cct caa gtg aga cct ttg tec 3345 
Val Leu Pro Asn Phe Arg Ser Asn Arg Pro Gin Val Arg Pro Leu Ser 
965 970 975 



cct gga gaa agt ggt gec tgg gac att cct gga ggg ate atg cct ggc 3393 
Pro Gly Glu Ser Gly Ala Trp Asp lie Pro Gly Gly lie Met Pro Gly 
980 985 990 



cgc tac aac cag gaa gta ggc cag acc ate cct gtt ttt gec ttc ctt 3441 
Arg Tyr Asn Gin Glu Val Gly Gin Thr He Pro Val Phe Ala Phe Leu 
995 1000 1005 



gga gee atg gtg gec ctg gee ttc ttc gtg gta cag ate agt aag gec 348 9 
Gly Ala Met Val Ala Leu Ala Phe Phe Val Val Gin He Ser Lys Ala 
1010 1015 1020 



aag age egg ccg aag egg agg agg ccc agg gca aag cgt cca caa ctt 3537 
Lys Ser Arg Pro Lys Arg Arg Arg Pro Arg Ala Lys Arg Pro Gin Leu 
1025 1030 1035 1040 



gca cag cag gec cac cct gca agg acc ccg tea gtg tgatcatcac 3583 
Ala Gin Gin Ala His Pro Ala Arg Thr Pro Ser Val 
1045 1050 

agtggccaga cacagaagct gacaagcttt gaacccctct ggtggccaca cagcatcaga 3643 

gagcatcctg ggaagtgcct gtttccaagg agccctatct ctggattgtg gctggcttag 3703 

tgtgttctgc ccagacgtct atgaggtaca tectgeagtg cctcactgtg tttggctctg 37 63 

gecgaaggtg cccagtagct cagcctccgg tggcatcagg cccagtgaca gtgcaccaaa 3823 

gacacagagc ctggaagggc tgtegggaca tactttctac ataatgetae aaccctgacc 388 3 

aagegaagae at 38 95 



<210> 2 
<211> 1052 
<212> PRT 
<213> Rattus sp. 

<400> 2 

Met Lys Leu Val Asn He Trp Leu Leu Leu Leu Val Val Leu Leu Cys 
15 10 15 

Gly Lys Lys His Leu Gly Asp Arg Leu Gly Lys Lys Ala Phe Glu Lys 
20 25 30 
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Ala Pro Cys Pro Ser Cys Ser His Leu Thr Leu Lys Val Glu Phe Ser 
35 40 45 



Ser Thr Val Val Glu Tyr Glu Tyr lie Val Ala Phe Asn Gly Tyr Phe 



Thr Ala Lys Ala Arg Asn Ser Phe lie Ser Ser Ala Leu Lys Ser Ser 
65 70 75 80 

Glu Val Asp Asn Trp Arg lie lie Pro Arg Asn Asn Pro Ser Ser Asp 



Tyr Pro Ser Asp Phe Glu Val lie Gin lie Lys Glu Lys Gin Lys Ala 
100 105 110 

Gly Leu Leu Thr Leu Glu Asp His Pro Asn lie Lys Arg Val Thr Pro 
115 120 125 

Gin Arg Lys Val Phe Arg Ser Leu Lys Phe Ala Glu Ser Asp Pro lie 
130 135 140 

Val Pro Cys Asn Glu Thr Arg Trp Ser Gin Lys Trp Gin Ser Ser Arg 
145 150 155 160 

Pro Leu Lys Arg Ala Ser Leu Ser Leu Gly Ser Gly Phe Trp His Ala 
165 170 175 

Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala lie Pro Arg Gin 
180 185 190 

Val Ala Gin Thr Leu Gin Ala Asp Val Leu Trp Gin Met Gly Tyr Thr 
195 200 205 

Gly Ala Asn Val Arg Val Ala Val Phe Asp Thr Gly Leu Ser Glu Lys 
210 215 220 

His Pro His Phe Lys Asn Val Lys Glu Arg Thr Asn Trp Thr Asn Glu 
225 230 235 240 

Arg Thr Leu Asp Asp Gly Leu Gly His Gly Thr Phe Val Ala Gly Val 
245 250 255 

lie Ala Ser Met Arg Glu Cys Gin Gly Phe Ala Pro Asp Ala Glu Leu 
260 265 270 



His lie Phe Arg Val Phe Thr Asn Asn Gin Val Ser Tyr Thr Ser Trp 
275 280 285 



Phe Leu Asp Ala Phe Asn Tyr Ala lie Leu Lys Lys Met Asp Val Leu 
290 295 300 



Asn Leu Ser lie Gly Gly Pro Asp Phe Met Asp His Pro Phe Val Asp 
305 310 315 320 

Lys Val Trp Glu Leu Thr Ala Asn Asn Val lie Met Val Ser Ala lie 
325 330 335 

Gly Asn Asp Gly Pro Leu Tyr Gly Thr Leu Asn Asn Pro Ala Asp Gin 
340 345 350 

Met Asp Val lie Gly Val Gly Gly lie Asp Phe Glu Asp Asn lie Ala 
355 360 365 

Arg Phe Ser Ser Arg Gly Met Thr Thr Trp Glu Leu Pro Gly Gly Tyr 
370 375 380 

Gly Arg Val Lys Pro Asp lie Val Thr Tyr Gly Ala Gly Val Arg Gly 
385 390 395 400 

Ser Gly Val Lys Gly Gly Cys Arg Ala Leu Ser Gly Thr Ser Val Ala 
405 410 415 

Ser Pro Val Val Ala Gly Ala Val Thr Leu Leu Val Ser Thr Val Gin 
420 425 430 

Lys Arg Glu Leu Val Asn Pro Ala Ser Val Lys Gin Ala Leu lie Ala 
435 440 445 

Ser Ala Arg Arg Leu Pro Gly Val Asn Met Phe Glu Gin Gly His Gly 
450 455 460 

Lys Leu Asp Leu Leu Arg Ala Tyr Gin lie Leu Ser Ser Tyr Lys Pro 
465 470 475 480 

Gin Ala Ser Leu Ser Pro Ser Tyr lie Asp Leu Thr Glu Cys Pro Tyr 
485 490 495 

Met Trp Pro Tyr Cys Ser Gin Pro lie Tyr Tyr Gly Gly Met Pro Thr 
500 505 510 

lie Val Asn Val Thr lie Leu Asn Gly Met Gly Val Thr Gly Arg lie 
515 520 525 

Val Asp Lys Pro Glu Trp Arg Pro Tyr Leu Pro Gin Asn Gly Asp Asn 
530 535 540 
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lie Glu Val Ala Phe Ser Tyr Ser Ser Val Leu Trp Pro Trp Ser Gly 
545 550 555 560 



Tyr Leu Ala lie Ser 
565 

Gly He Ala Gin Gly 
580 

Thr Glu Leu Lys Asn 
595 

He Lys Val Lys He 
610 

Trp Asp Gin Tyr His 
625 

Asp Asn Leu Arg Met 
645 

Val His Thr Asn Phe 
660 

Tyr Phe Val Glu Val 
675 

Gin Tyr Gly Thr Leu 
690 

Glu Glu He Ala Lys 
705 

Val Val Phe Ser Asp 
725 

Phe Tyr Asp Glu Asn 
740 

Ala Asn Val Pro Ala 
755 

Phe Ser Asp Gly Leu 
770 

Met Tyr Tyr Ala Ser 
785 



He Ser Val Thr Lys Lys 
570 

His He Met He Thr Val 
585 

Gly Ala Glu His Thr Ser 
600 

He Pro Thr Pro Pro Arg 
615 

Asn Leu Arg Tyr Pro Pro 
630 635 

Lys Asn Asp Pro Leu Asp 
650 

Arg Asp Met Tyr Gin His 
665 

Leu Gly Ala Pro Phe Thr 
680 

Leu Met Val Asp Ser Glu 
695 

Leu Arg Arg Asp Val Asp 
710 715 

Trp Tyr Asn Thr Ser Val 
730 

Thr Arg Gin Trp Trp Met 
745 

Leu Asn Glu Leu Leu Ser 
760 

Tyr Glu Gly Glu Phe Ala 
775 

Gly Cys Ser He Ala Arg 
790 795 



Ala Ala Ser Trp Glu 
575 

Ala Ser Pro Ala Glu 
590 

Thr Val Lys Leu Pro 
605 

Ser Lys Arg Val Leu 
620 

Gly Tyr Phe Pro Arg 
640 

Trp Asn Gly Asp His 
655 

Leu Arg Ser Met Gly 
670 

Cys Phe Asp Ala Thr 
685 

Glu Glu Tyr Phe Pro 
700 

Asn Gly Leu Ser Leu 
720 

Met Arg Lys Val Lys 
735 

Pro Asp Thr Gly Gly 
750 

Val Trp Asn Met Gly 
765 

Leu Ala Asn His Asp 
780 

Phe Pro Glu Asp Gly 
800 
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Val Val lie Thr Gin Thr Phe Lys Asp Gin Gly Leu Glu Val Leu Lys 
805 810 815 



Gin Glu Thr Ala Val Val Asp Asn Val Pro lie Leu Gly Leu Tyr Gin 
820 825 830 

lie Pro Ala Glu Gly Gly Gly Arg lie Val Leu Tyr Gly Asp Ser Asn 



Cys Leu Asp Asp Ser His Arg Gin Lys Asp Cys Phe Trp Leu Leu Asp 
850 855 860 

Ala Leu Leu Gin Tyr Thr Ser Tyr Gly Val Thr Pro Pro Ser Leu Ser 
865 870 875 880 

His Ser Gly Asn Arg Gin Arg Pro Pro Ser Gly Ala Gly Leu Ala Pro 
885 890 895 

Pro Glu Arg Met Glu Gly Asn His Leu His Arg Tyr Ser Lys Val Leu 
900 905 910 

Glu Ala His Leu Gly Asp Pro Lys Pro Arg Pro Leu Pro Ala Cys Pro 
915 920 925 

His Leu Ser Trp Ala Lys Pro Gin Pro Leu Asn Glu Thr Ala Pro Ser 
930 935 940 

Asn Leu Trp Lys His Gin Lys Leu Leu Ser lie Asp Leu Asp Lys Val 
945 950 955 960 

Val Leu Pro Asn Phe Arg Ser Asn Arg Pro Gin Val Arg Pro Leu Ser 
965 970 975 

Pro Gly Glu Ser Gly Ala Trp Asp lie Pro Gly Gly lie Met Pro Gly 
980 985 990 

Arg Tyr Asn Gin Glu Val Gly Gin Thr lie Pro Val Phe Ala Phe Leu 
995 1000 1005 

Gly Ala Met Val Ala Leu Ala Phe Phe Val Val Gin He Ser Lys Ala 
1010 1015 1020 

Lys Ser Arg Pro Lys Arg Arg Arg Pro Arg Ala Lys Arg Pro Gin Leu 
025 1030 1035 1040 

Ala Gin Gin Ala His Pro Ala Arg Thr Pro Ser Val 
1045 1050 
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<210> 3 
<211> 3788 
<212> DNA 
<213> Mus sp. 



<220> 

<221> CDS 

<222> (8) . . (3163) 



<400> 3 

gcattcc atg aag etc gtc age acc tgg ctt ctt gtg ctg gtg gtt ttg 4 9 
Met Lys Leu Val Ser Thr Trp Leu Leu Val Leu Val Val Leu 
15 10 



etc tgt ggg aaa egg cac ctg ggc gac agg ctg ggg acg aga get ttg 97 
Leu Cys Gly Lys Arg His Leu Gly Asp Arg Leu Gly Thr Arg Ala Leu 
15 20 25 30 



gaa aag gee ccg tgc ccc age tgc tec cac ctg act ttg aag gtg gaa 145 
Glu Lys Ala Pro Cys Pro Ser Cys Ser His Leu Thr Leu Lys Val Glu 
35 40 45 



ttc tct tea act gtg gtg gag tac gaa tat att gtg get ttc aac gga 193 
Phe Ser Ser Thr Val Val Glu Tyr Glu Tyr lie Val Ala Phe Asn Gly 
50 55 60 



tac ttc aca gee aaa get aga aac tea ttt att tea agt gcg ctg aaa 241 
Tyr Phe Thr Ala Lys Ala Arg Asn Ser Phe lie Ser Ser Ala Leu Lys 
65 70 75 



age agt gaa gtg gaa aac tgg aga ata ata cct egg aac aac cca tec 289 
Ser Ser Glu Val Glu Asn Trp Arg lie lie Pro Arg Asn Asn Pro Ser 
80 85 90 



agt gac tac cct agt gat ttt gag gtg att cag ata aaa gag aag cag 337 
Ser Asp Tyr Pro Ser Asp Phe Glu Val lie Gin lie Lys Glu Lys Gin 
95 100 105 110 



aag gcg ggg ctg etc aca ctt gaa gat cac ccc aac ate aag egg gtg 385 
Lys Ala Gly Leu Leu Thr Leu Glu Asp His Pro Asn lie Lys Arg Val 
115 120 125 



aca ccc cag egg aaa gtc ttt cgt tec etc aag ttt get gaa tec aac 433 
Thr Pro Gin Arg Lys Val Phe Arg Ser Leu Lys Phe Ala Glu Ser Asn 
130 135 140 
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ccc ate gtg ccc tgt aat gaa acc egg tgg age cag aag tgg cag tea 
Pro lie Val Pro Cys Asn Glu Thr Arg Trp Ser Gin Lys Trp Gin Ser 
145 150 155 

tea cgt ccc ctg aaa aga gee agt etc tec ctg ggc tct gga ttc tgg 
Ser Arg Pro Leu Lys Arg Ala Ser Leu Ser Leu Gly Ser Gly Phe Trp 
160 165 170 

cat gca aca gga aga cat tea agt egg cga ttg ctg aga gec att cct 
His Ala Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala lie Pro 
175 180 185 190 

cgc cag gtc gec cag aca ctg cag gca gat gtg ctg tgg cag atg gga 
Arg Gin Val Ala Gin Thr Leu Gin Ala Asp Val Leu Trp Gin Met Gly 
195 200 205 

tac aca ggt get aat gtc aga gtt get gtt ttt gat act ggg etc agt 
Tyr Thr Gly Ala Asn Val Arg Val Ala Val Phe Asp Thr Gly Leu Ser 
210 215 220 

gag aag cat ccg cat ttt aag aat gtg aag gag aga acc aac tgg acc 
Glu Lys His Pro His Phe Lys Asn Val Lys Glu Arg Thr Asn Trp Thr 
225 230 235 

aat gag egg acc ctg gat gat ggg eta ggc cat ggc aca ttc gtt gca 
Asn Glu Arg Thr Leu Asp Asp Gly Leu Gly His Gly Thr Phe Val Ala 
240 245 250 

ggt gtg att gec age atg agg gag tgc caa gga ttt get cca gat gca 
Gly Val lie Ala Ser Met Arg Glu Cys Gin Gly Phe Ala Pro Asp Ala 
255 260 265 270 

gag ctg cac ate ttc agg gtc ttt acc aac aat cag gtg tct tac aca 
Glu Leu His lie Phe Arg Val Phe Thr Asn Asn Gin Val Ser Tyr Thr 
275 280 285 

tct tgg ttt ctg gat gec ttc aac tat gec ate eta aag aag atg gac 
Ser Trp Phe Leu Asp Ala Phe Asn Tyr Ala lie Leu Lys Lys Met Asp 
290 295 300 

gtt etc aac ctt age ate ggt ggg ccc gac ttc atg gat cat ccg ttt 
Val Leu Asn Leu Ser lie Gly Gly Pro Asp Phe Met Asp His Pro Phe 
305 310 315 

gtt gac aag gtg tgg gaa tta aca get aac aat gta att atg gtt tct 
Val Asp Lys Val Trp Glu Leu Thr Ala Asn Asn Val lie Met Val Ser 
320 325 330 
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get att ggc aat gat gga cct etc tat ggc act ctg aat aac cct get 

Ala lie Gly Asn Asp Gly Pro Leu Tyr Gly Thr Leu Asn Asn Pro Ala 

335 340 345 350 

gat cag atg gat gtg att gga gtg ggt ggc att gac ttt gaa gat aac 

Asp Gin Met Asp Val He Gly Val Gly Gly He Asp Phe Glu Asp Asn 

355 360 365 

ate get cgc ttt tct tec agg gga atg act ace tgg gaa tta cca gga 

He Ala Arg Phe Ser Ser Arg Gly Met Thr Thr Trp Glu Leu Pro Gly 

370 375 380 

ggc tat ggt cgt gtg aag cct gac att gtc acc tat ggt get gga gtg 

Gly Tyr Gly Arg Val Lys Pro Asp He Val Thr Tyr Gly Ala Gly Val 

385 390 395 

egg ggt tec ggt gtg aaa ggg ggc tgc cgt gca etc tea ggg acc agt 

Arg Gly Ser Gly Val Lys Gly Gly Cys Arg Ala Leu Ser Gly Thr Ser 

400 405 410 

gtc get tec cca gtg gtc get ggg gec gtc acc ttg tta gta age aca 

Val Ala Ser Pro Val Val Ala Gly Ala Val Thr Leu Leu Val Ser Thr 

415 420 425 430 

gta cag aag egg gag ctg gtg aat cct gee agt gtg aag caa get ttg 

Val Gin Lys Arg Glu Leu Val Asn Pro Ala Ser Val Lys Gin Ala Leu 

435 440 445 

ata gcg tea gee egg aga ctt cct ggg gtc aac atg ttc gag caa ggt 

He Ala Ser Ala Arg Arg Leu Pro Gly Val Asn Met Phe Glu Gin Gly 

450 455 460 

cat ggc aag ttg gat ctg ctg cga get tat cag ate etc age age tat 

His Gly Lys Leu Asp Leu Leu Arg Ala Tyr Gin He Leu Ser Ser Tyr 

465 470 475 

aaa ccg cag gca age ctg agt cct age tac ate gac ctg act gag tgt 

Lys Pro Gin Ala Ser Leu Ser Pro Ser Tyr He Asp Leu Thr Glu Cys 

480 485 490 

ccc tac atg tgg ccc tac tgc tec cag cct ate tac tat gga gga atg 

Pro Tyr Met Trp Pro Tyr Cys Ser Gin Pro He Tyr Tyr Gly Gly Met 

495 500 505 510 

cca aca ate gtt aat gtc acc ate etc aat ggc atg ggc gtc aca gga 

Pro Thr lie Val Asn Val Thr lie Leu Asn Gly Met Gly Val Thr Gly 

515 520 525 
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aga att gtg gat aag cct gag tgg cga ccc tat tta cca cag aat gga 

Arg lie Val Asp Lys Pro Glu Trp Arg Pro Tyr Leu Pro Gin Asn Gly 

530 535 540 

gac aac att gaa gtg gcc ttc tec tac tec tea gtg ttg tgg ccc tgg 

Asp Asn lie Glu Val Ala Phe Ser Tyr Ser Ser Val Leu Trp Pro Trp 

545 550 555 

tea ggt tac ctt gcc ate tec att tct gtg acc aag aag gca get tec 

Ser Gly Tyr Leu Ala lie Ser lie Ser Val Thr Lys Lys Ala Ala Ser 

560 565 570 

tgg gaa ggc ate get cag ggc cac ate atg ate aca gtg gcg tec cca 

Trp Glu Gly lie Ala Gin Gly His He Met He Thr Val Ala Ser Pro 

575 580 585 590 

gca gag aca gag tta cac agt ggt gcg gag cac act tec acc gtg aag 

Ala Glu Thr Glu Leu His Ser Gly Ala Glu His Thr Ser Thr Val Lys 

595 600 605 

ctg ccc ate aag gtg aag ate att ccc acc cct cct egg age aag aga 

Leu Pro He Lys Val Lys He He Pro Thr Pro Pro Arg Ser Lys Arg 

610 615 620 

gtc etc tgg gac cag tac cac aac etc cgc tac cca cct ggc tac ttc 

Val Leu Trp Asp Gin Tyr His Asn Leu Arg Tyr Pro Pro Gly Tyr Phe 

625 630 635 

ccc agg gac aac ttg egg atg aag aat gac cct tta gac tgg aat ggc 

Pro Arg Asp Asn Leu Arg Met Lys Asn Asp Pro Leu Asp Trp Asn Gly 

640 645 650 

gac cac gtc cac acc aac ttc agg gac atg tac cag cat ctg cgc age 

Asp His Val His Thr Asn Phe Arg Asp Met Tyr Gin His Leu Arg Ser 

655 660 665 670 

atg ggc tac ttc gtg gag gtg etc ggc gcc cca ttc aca tgt ttt gac 

Met Gly Tyr Phe Val Glu Val Leu Gly Ala Pro Phe Thr Cys Phe Asp 

675 680 685 

gcc aca cag tat ggc act ttg ctg ctg gtg gac agt gag gaa gag tac 
Ala Thr Gin Tyr Gly Thr Leu Leu Leu Val Asp Ser Glu Glu Glu Tyr 

690 695 700 

ttc cct gag gag att get aag ctg agg agg gat gtg gac aat ggc ctt 

Phe Pro Glu Glu He Ala Lys Leu Arg Arg Asp Val Asp Asn Gly Leu 

705 710 715 
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tec etc gtc ate ttc agt gac tgg tac aac act tct gtt atg aga aaa 
Ser Leu Val lie Phe Ser Asp Trp Tyr Asn Thr Ser Val Met Arg Lys 
720 725 730 

gtg aag ttt tat gat gaa aac acc agg cag tgg tgg atg cca gac acc 
Val Lys Phe Tyr Asp Glu Asn Thr Arg Gin Trp Trp Met Pro Asp Thr 
735 740 745 750 

gga gga gcg aac ate cca get ctg aat gag ctg ctg tct gtg tgg aac 
Gly Gly Ala Asn lie Pro Ala Leu Asn Glu Leu Leu Ser Val Trp Asn 
755 760 765 

atg ggg ttc agt gac ggc eta tat gaa ggg gag ttt gtc ctg gca aac 
Met Gly Phe Ser Asp Gly Leu Tyr Glu Gly Glu Phe Val Leu Ala Asn 
770 775 780 

cat gac atg tac tat gcg teg ggg tgc age ate gee aag ttt cca gaa 
His Asp Met Tyr Tyr Ala Ser Gly Cys Ser lie Ala Lys Phe Pro Glu 
785 790 795 

gat ggc gtc gtg ate aca cag act ttc aag gac caa gga ttg gag gtc 
Asp Gly Val Val lie Thr Gin Thr Phe Lys Asp Gin Gly Leu Glu Val 
800 805 810 

tta aaa caa gag aca gca gtt gtg gaa aat gtt ccc att ttg ggg ctt 
Leu Lys Gin Glu Thr Ala Val Val Glu Asn Val Pro lie Leu Gly Leu 
815 820 825 830 

tat cag att cca tct gaa ggt gga ggc egg ate gtg ctg tat gga gac 
Tyr Gin lie Pro Ser Glu Gly Gly Gly Arg lie Val Leu Tyr Gly Asp 
835 840 845 

tec aac tgc ttg gat gac agt cac aga cag aag gac tgc ttt tgg ctt 
Ser Asn Cys Leu Asp Asp Ser His Arg Gin Lys Asp Cys Phe Trp Leu 
850 855 860 

ctg gat gcg etc ctt cag tac aca tec tat ggc gtg acc cct ccc age 
Leu Asp Ala Leu Leu Gin Tyr Thr Ser Tyr Gly Val Thr Pro Pro Ser 
865 870 875 

etc age cat tea ggg aac egg cag cgc cca cct age gga gee ggc ttg 
Leu Ser His Ser Gly Asn Arg Gin Arg Pro Pro Ser Gly Ala Gly Leu 
880 885 890 

gec cct cct gaa agg atg gaa gga aac cac etc cat egg tac tec aaa 
Ala Pro Pro Glu Arg Met Glu Gly Asn His Leu His Arg Tyr Ser Lys 
895 900 905 910 
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gtt ctt gaa gcc cac ttg gga gac ccg aaa cct egg ccc ctg cca gec 27 85 
Val Leu Glu Ala His Leu Gly Asp Pro Lys Pro Arg Pro Leu Pro Ala 
915 920 925 



tgt cca cat ttg tea tgg gcc aag cca cag cct ttg aat gag acg gca 2833 
Cys Pro His Leu Ser Trp Ala Lys Pro Gin Pro Leu Asn Glu Thr Ala 
930 935 940 



ccc agt aat ctt tgg aaa cat cag aag ctg etc tec att gac ctg gac 2881 
Pro Ser Asn Leu Trp Lys His Gin Lys Leu Leu Ser lie Asp Leu Asp 
945 950 955 



aaa gta gtg tta ccc aac ttt cga tec aat cgc cct caa gtg aga cct 2929 
Lys Val Val Leu Pro Asn Phe Arg Ser Asn Arg Pro Gin Val Arg Pro 
960 965 970 



ttg tec cct gga gag agt ggt gcc tgg gac att cct gga ggg ate atg 2977 
Leu Ser Pro Gly Glu Ser Gly Ala Trp Asp lie Pro Gly Gly lie Met 
975 980 985 990 



cct ggc cgc tac aac cag gag gtg gga cag acc ate ccc gtc ttc gcc 3025 
Pro Gly Arg Tyr Asn Gin Glu Val Gly Gin Thr lie Pro Val Phe Ala 
995 1000 1005 



ttc etc gga gcc atg gtg gcc ctg gcc ttc ttt gtg gta cag ate age 3073 
Phe Leu Gly Ala Met Val Ala Leu Ala Phe Phe Val Val Gin lie Ser 
1010 1015 1020 



aag gcc aag age egg ccg aag egg agg agg ccc agg gca aag cgt cca 3121 
Lys Ala Lys Ser Arg Pro Lys Arg Arg Arg Pro Arg Ala Lys Arg Pro 
1025 1030 1035 



caa ctt gca cag cag gcc cac cct gca agg acc cca tea gtg 
Gin Leu Ala Gin Gin Ala His Pro Ala Arg Thr Pro Ser Val 
1040 1045 1050 



tgagcatege agtagecage cacagaagct 
cagcgcctca gagagcattc tgggaagtgc 
ggctatctta ctgtgttctg cccaggcacc 
tgcttggctc tggcagaagg cacccagtga 
caaagacaca gagcctggaa gggctgtcgg 
gaccaagcaa agacatgett gttacaggct 
17 



aacaagcett gaaccactct ggtggccaca 3223 
ctgtttccga ggaccctgtc tccagcttgt 3283 
tgatgaggta catcctgcag tgcctctctg 3343 
catcaggcat caggcccagt gaeagtgeae 3403 
gacatacttt ctacataacg ctacaaccct 3463 
attttctata tttattgtgg gagagtcact 3523 



ttaaagactg tgctagttgg aaacagagct gttgctgttg tcagtcgagt gcagttttct 3583 
gcagcgatgt cataaggagt cagattccgt gacctcctct ttgatggagg acacactgaa 3643 
ctgaagggga cttgcgcgga tgtgggagat gcaagccttc gctttatttt tttataacta 3703 
tcaactgcca tcatgttttg taatttgggg atcttgattt caccgttgtt ggtgaaggaa 37 63 
attttcaata aatatgcata acctt 3788 



<210> 4 
<211> 1052 
<212> PRT 
<213> Mus sp. 

<400> 4 

Met Lys Leu Val Ser Thr Trp Leu Leu Val Leu Val Val Leu Leu Cys 
15 10 15 

Gly Lys Arg His Leu Gly Asp Arg Leu Gly Thr Arg Ala Leu Glu Lys 
20 25 30 

Ala Pro Cys Pro Ser Cys Ser His Leu Thr Leu Lys Val Glu Phe Ser 
35 40 45 

Ser Thr Val Val Glu Tyr Glu Tyr lie Val Ala Phe Asn Gly Tyr Phe 
50 55 60 

Thr Ala Lys Ala Arg Asn Ser Phe lie Ser Ser Ala Leu Lys Ser Ser 
65 70 75 80 

Glu Val Glu Asn Trp Arg lie lie Pro Arg Asn Asn Pro Ser Ser Asp 
85 90 95 

Tyr Pro Ser Asp Phe Glu Val lie Gin lie Lys Glu Lys Gin Lys Ala 
100 105 110 

Gly Leu Leu Thr Leu Glu Asp His Pro Asn lie Lys Arg Val Thr Pro 
115 120 125 

Gin Arg Lys Val Phe Arg Ser Leu Lys Phe Ala Glu Ser Asn Pro lie 
130 135 140 

Val Pro Cys Asn Glu Thr Arg Trp Ser Gin Lys Trp Gin Ser Ser Arg 
145 150 155 160 
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Pro Leu Lys Arg Ala Ser Leu Ser Leu Gly Ser Gly Phe Trp His Ala 
165 170 175 



Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala lie Pro Arg Gin 
180 185 190 

Val Ala Gin Thr Leu Gin Ala Asp Val Leu Trp Gin Met Gly Tyr Thr 
195 200 205 

Gly Ala Asn Val Arg Val Ala Val Phe Asp Thr Gly Leu Ser Glu Lys 
210 215 220 

His Pro His Phe Lys Asn Val Lys Glu Arg Thr Asn Trp Thr Asn Glu 
225 230 235 240 

Arg Thr Leu Asp Asp Gly Leu Gly His Gly Thr Phe Val Ala Gly Val 
245 250 255 

lie Ala Ser Met Arg Glu Cys Gin Gly Phe Ala Pro Asp Ala Glu Leu 
260 265 270 

His lie Phe Arg Val Phe Thr Asn Asn Gin Val Ser Tyr Thr Ser Trp 
275 280 285 

Phe Leu Asp Ala Phe Asn Tyr Ala lie Leu Lys Lys Met Asp Val Leu 
290 295 300 

Asn Leu Ser lie Gly Gly Pro Asp Phe Met Asp His Pro Phe Val Asp 
305 310 315 320 

Lys Val Trp Glu Leu Thr Ala Asn Asn Val lie Met Val Ser Ala lie 
325 330 335 

Gly Asn Asp Gly Pro Leu Tyr Gly Thr Leu Asn Asn Pro Ala Asp Gin 
340 345 350 

Met Asp Val lie Gly Val Gly Gly lie Asp Phe Glu Asp Asn lie Ala 
355 360 365 

Arg Phe Ser Ser Arg Gly Met Thr Thr Trp Glu Leu Pro Gly Gly Tyr 
370 375 380 

Gly Arg Val Lys Pro Asp lie Val Thr Tyr Gly Ala Gly Val Arg Gly 
385 390 395 400 

Ser Gly Val Lys Gly Gly Cys Arg Ala Leu Ser Gly Thr Ser Val Ala 
405 410 415 
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Ser Pro Val Val Ala Gly Ala Val Thr Leu Leu Val Ser Thr Val Gin 
420 425 430 



Lys Arg Glu Leu Val 
435 

Ser Ala Arg Arg Leu 
450 

Lys Leu Asp Leu Leu 
465 

Gin Ala Ser Leu Ser 
485 

Met Trp Pro Tyr Cys 
500 

lie Val Asn Val Thr 
515 

Val Asp Lys Pro Glu 
530 

lie Glu Val Ala Phe 
545 

Tyr Leu Ala lie Ser 
565 

Gly He Ala Gin Gly 
580 

Thr Glu Leu His Ser 
595 

He Lys Val Lys He 
610 

Trp Asp Gin Tyr His 
625 

Asp Asn Leu Arg Met 
645 

Val His Thr Asn Phe 
660 



Asn Pro Ala Ser Val Lys 
440 

Pro Gly Val Asn Met Phe 
455 

Arg Ala Tyr Gin He Leu 
470 475 

Pro Ser Tyr He Asp Leu 
490 

Ser Gin Pro He Tyr Tyr 
505 

He Leu Asn Gly Met Gly 
520 

Trp Arg Pro Tyr Leu Pro 
535 

Ser Tyr Ser Ser Val Leu 
550 555 

He Ser Val Thr Lys Lys 
570 

His He Met He Thr Val 
585 

Gly Ala Glu His Thr Ser 
600 

He Pro Thr Pro Pro Arg 
615 

Asn Leu Arg Tyr Pro Pro 
630 635 

Lys Asn Asp Pro Leu Asp 
650 

Arg Asp Met Tyr Gin His 
665 



Gin Ala Leu He Ala 
445 

Glu Gin Gly His Gly 
460 

Ser Ser Tyr Lys Pro 
480 

Thr Glu Cys Pro Tyr 
495 

Gly Gly Met Pro Thr 
510 

Val Thr Gly Arg He 
525 

Gin Asn Gly Asp Asn 
540 

Trp Pro Trp Ser Gly 
560 

Ala Ala Ser Trp Glu 
575 

Ala Ser Pro Ala Glu 
590 

Thr Val Lys Leu Pro 
605 

Ser Lys Arg Val Leu 
620 

Gly Tyr Phe Pro Arg 
640 

Trp Asn Gly Asp His 
655 

Leu Arg Ser Met Gly 
670 
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Tyr Phe Val Glu Val Leu Gly Ala Pro Phe Thr Cys Phe Asp Ala Thr 
675 680 685 



Gin Tyr Gly Thr Leu Leu Leu Val Asp Ser Glu Glu Glu Tyr Phe Pro 
690 695 700 

Glu Glu lie Ala Lys Leu Arg Arg Asp Val Asp Asn Gly Leu Ser Leu 
705 710 715 720 

Val lie Phe Ser Asp Trp Tyr Asn Thr Ser Val Met Arg Lys Val Lys 
725 730 735 

Phe Tyr Asp Glu Asn Thr Arg Gin Trp Trp Met Pro Asp Thr Gly Gly 
740 745 750 

Ala Asn He Pro Ala Leu Asn Glu Leu Leu Ser Val Trp Asn Met Gly 
755 760 765 

Phe Ser Asp Gly Leu Tyr Glu Gly Glu Phe Val Leu Ala Asn His Asp 
770 775 780 

Met Tyr Tyr Ala Ser Gly Cys Ser He Ala Lys Phe Pro Glu Asp Gly 
785 790 795 800 

Val Val He Thr Gin Thr Phe Lys Asp Gin Gly Leu Glu Val Leu Lys 
805 810 815 

Gin Glu Thr Ala Val Val Glu Asn Val Pro He Leu Gly Leu Tyr Gin 
820 825 830 

He Pro Ser Glu Gly Gly Gly Arg He Val Leu Tyr Gly Asp Ser Asn 
835 840 845 

Cys Leu Asp Asp Ser His Arg Gin Lys Asp Cys Phe Trp Leu Leu Asp 
850 855 860 

Ala Leu Leu Gin Tyr Thr Ser Tyr Gly Val Thr Pro Pro Ser Leu Ser 
865 870 875 880 

His Ser Gly Asn Arg Gin Arg Pro Pro Ser Gly Ala Gly Leu Ala Pro 
885 890 895 

Pro Glu Arg Met Glu Gly Asn His Leu His Arg Tyr Ser Lys Val Leu 
900 905 910 

Glu Ala His Leu Gly Asp Pro Lys Pro Arg Pro Leu Pro Ala Cys Pro 
915 920 925 
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His Leu Ser Trp Ala Lys Pro Gin Pro Leu Asn Glu Thr Ala Pro Ser 
930 935 940 

Asn Leu Trp Lys His Gin Lys Leu Leu Ser He Asp Leu Asp Lys Val 
945 950 955 960 

Val Leu Pro Asn Phe Arg Ser Asn Arg Pro Gin Val Arg Pro Leu Ser 
965 970 975 

Pro Gly Glu Ser Gly Ala Trp Asp He Pro Gly Gly He Met Pro Gly 
980 985 990 

Arg Tyr Asn Gin Glu Val Gly Gin Thr He Pro Val Phe Ala Phe Leu 
995 1000 1005 

Gly Ala Met Val Ala Leu Ala Phe Phe Val Val Gin lie Ser Lys Ala 
1010 1015 1020 

Lys Ser Arg Pro Lys Arg Arg Arg Pro Arg Ala Lys Arg Pro Gin Leu 
025 1030 1035 1040 

Ala Gin Gin Ala His Pro Ala Arg Thr Pro Ser Val 
1045 1050 



<210> 5 

<211> 4338 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (497) . . (3652) 
<400> 5 

cagggcacgc tgggtcggcg gagctgaggc tcccagctgt gggcctcgct ggcccggtcg 60 
cccagtctcg cgagagttgg gagtaaacag ccccgaatgg agtgcccagg cgtgttcgcc 120 
gcggaggcgc cgttatcccg ggcccgccgg ccctgagctc ccggcggcgc agattggctc 180 
acagtggttg attgatcaac cccattggac gttggttctg tggtacaaat ggagtacagg 240 
actcagtcgt cacggcctga gtgagagaag ccttatttcc aagatggaga agaagcggag 300 
aaagaaatga aagcctctct tcaggctgaa ccacaaaagg ccatgggatt taacttttat 360 
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ttatgttggg caagactgta agatggctga tcagtaatgt tgcagctttt agctgaaaca 4 20 



aaaattcact tttaatcaag aagaaaaaag tgtgatttga atatatgcaa ttttatgatc 4 80 

atattcgctt gtgacc atg aag ctt gtc aac ate tgg ctg ctt ctg etc gtg 532 
Met Lys Leu Val Asn lie Trp Leu Leu Leu Leu Val 
15 10 

gtt ttg etc tgt ggg aag aaa cat ctg ggc gac aga ctg gaa aag aaa 580 
Val Leu Leu Cys Gly Lys Lys His Leu Gly Asp Arg Leu Glu Lys Lys 
15 20 25 

tct ttt gaa aag gee cca tgc cct ggc tgt tec cac ctg act ttg aag 628 
Ser Phe Glu Lys Ala Pro Cys Pro Gly Cys Ser His Leu Thr Leu Lys 



gtg gaa ttc tea tea aca gtt gtg gaa tat gaa tat att gtg get ttc 
Val Glu Phe Ser Ser Thr Val Val Glu Tyr Glu Tyr lie Val Ala Phe 
45 50 55 60 

aat gga tac ttt aca gee aaa get aga aat tea ttt att tea agt gee 
Asn Gly Tyr Phe Thr Ala Lys Ala Arg Asn Ser Phe lie Ser Ser Ala 
65 70 75 

ctg aag age agt gaa gta gac aat tgg aga att ata cct cga aac aat 
Leu Lys Ser Ser Glu Val Asp Asn Trp Arg lie lie Pro Arg Asn Asn 



cca tec agt gac tac cct agt gat ttt gag gtg att cag ata aaa gaa 
Pro Ser Ser Asp Tyr Pro Ser Asp Phe Glu Val lie Gin lie Lys Glu 
95 100 105 

aaa cag aaa gcg ggg ctg eta aca ctt gaa gat cat cca aac ate aaa 
Lys Gin Lys Ala Gly Leu Leu Thr Leu Glu Asp His Pro Asn lie Lys 
110 115 120 

egg gtc acg ccc caa cga aaa gtc ttt cgt tec etc aag tat get gaa 
Arg Val Thr Pro Gin Arg Lys Val Phe Arg Ser Leu Lys Tyr Ala Glu 
125 130 135 140 

tct gac ccc aca gta ccc tgc aat gaa acc egg tgg age cag aag tgg 
Ser Asp Pro Thr Val Pro Cys Asn Glu Thr Arg Trp Ser Gin Lys Trp 
145 150 155 

caa tea tea cgt ccc ctg cga aga gee age etc tec ctg ggc tct ggc 
Gin Ser Ser Arg Pro Leu Arg Arg Ala Ser Leu Ser Leu Gly Ser Gly 
160 165 170 
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ttc tgg cat get acg gga agg cat teg age aga egg ctg ctg aga gec 
Phe Trp His Ala Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala 
175 180 185 

ate ccg cgc cag gtt gee cag aca ctg cag gca gat gtg etc tgg cag 
lie Pro Arg Gin Val Ala Gin Thr Leu Gin Ala Asp Val Leu Trp Gin 
190 195 200 

atg gga tat aca ggt get aat gta aga gtt get gtt ttt gac act ggg 
Met Gly Tyr Thr Gly Ala Asn Val Arg Val Ala Val Phe Asp Thr Gly 
205 210 215 220 

ctg age gag aag cat ccc cac ttc aaa aat gtg aag gag aga acc aac 
Leu Ser Glu Lys His Pro His Phe Lys Asn Val Lys Glu Arg Thr Asn 
225 230 235 

tgg acc aac gag cga acg ctg gac gat ggg ttg ggc cat ggc aca ttc 
Trp Thr Asn Glu Arg Thr Leu Asp Asp Gly Leu Gly His Gly Thr Phe 
240 245 250 

gtg gca ggt gtg ata gee age atg agg gag tgc caa gga ttt get cca 
Val Ala Gly Val lie Ala Ser Met Arg Glu Cys Gin Gly Phe Ala Pro 
255 260 265 

gat gca gaa ctt cac att ttc agg gtc ttt acc aat aat cag gta tct 
Asp Ala Glu Leu His lie Phe Arg Val Phe Thr Asn Asn Gin Val Ser 
270 275 280 

tac aca tct tgg ttt ttg gac gec ttc aac tat gee att tta aag aag 
Tyr Thr Ser Trp Phe Leu Asp Ala Phe Asn Tyr Ala lie Leu Lys Lys 
285 290 295 300 

ate gac gtg tta aac etc age ate ggc ggc ccg gac ttc atg gat cat 
lie Asp Val Leu Asn Leu Ser lie Gly Gly Pro Asp Phe Met Asp His 
305 310 315 

ccg ttt gtt gac aag gtg tgg gaa tta aca get aac aat gta ate atg 
Pro Phe Val Asp Lys Val Trp Glu Leu Thr Ala Asn Asn Val lie Met 
320 325 330 

gtt tct get att ggc aat gac gga cct ctt tat ggc act ctg aat aac 
Val Ser Ala lie Gly Asn Asp Gly Pro Leu Tyr Gly Thr Leu Asn Asn 
335 340 345 

cct get gat caa atg gat gtg att gga gta ggc ggc att gac ttt gaa 
Pro Ala Asp Gin Met Asp Val lie Gly Val Gly Gly lie Asp Phe Glu 
350 355 360 
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gat aac ate 
Asp Asn lie 
365 

cca gga ggc 
Pro Gly Gly 



ggc gtg egg 
Gly Val Arg 



acc agt gtt 
Thr Ser Val 
415 

age aca gtc 
Ser Thr Val 
430 

gec ctg ate 
Ala Leu lie 
445 

caa ggc cac 
Gin Gly His 



age tac aag 
Ser Tyr Lys 



gag tgt ccc 
Glu Cys Pro 
495 

gga atg ccg 
Gly Met Pro 
510 

aca gga aga 
Thr Gly Arg 
525 

aac gga gac 
Asn Gly Asp 



gee cgc ttt 
Ala Arg Phe 
370 

tac ggt cgc 
Tyr Gly Arg 
385 

ggt tct ggc 
Gly Ser Gly 
400 

get tct cca 
Ala Ser Pro 



cag aag cgt 
Gin Lys Arg 



gcg tea gec 
Ala Ser Ala 
450 

ggc aag etc 
Gly Lys Leu 
465 

cca cag gca 
Pro Gin Ala 
480 

tac atg tgg 
Tyr Met Trp 



aca gtt gtt 
Thr Val Val 



att gta gat 
lie Val Asp 
530 

aac att gaa 
Asn lie Glu 
545 



tct tea agg 
Ser Ser Arg 



atg aaa cct 
Met Lys Pro 



gtg aaa ggg 
Val Lys Gly 
405 

gtg gtt gca 
Val Val Ala 
420 

gag ctg gtg 
Glu Leu Val 
435 

egg agg etc 
Arg Arg Leu 



gat ctg etc 
Asp Leu Leu 



agt ttg age 
Ser Leu Ser 
485 

ccc tac tgc 
Pro Tyr Cys 
500 

aat gtc acc 
Asn Val Thr 
515 

aag cct gac 
Lys Pro Asp 



gtt gec ttc 
Val Ala Phe 



gga atg act 
Gly Met Thr 
375 

gac att gtc 
Asp He Val 
390 

ggg tgc egg 
Gly Cys Arg 



ggt get gtc 
Gly Ala Val 



aat ccc gec 
Asn Pro Ala 
440 

ccc ggg gtc 
Pro Gly Val 
455 

aga gec tat 
Arg Ala Tyr 
470 

ccc age tac 
Pro Ser Tyr 



tec cag ccc 
Ser Gin Pro 



ate etc aac 
He Leu Asn 
520 

tgg cag ccc 
Trp Gin Pro 
535 

tec tac tec 
Ser Tyr Ser 
550 



acc tgg gag 
Thr Trp Glu 



acc tat ggt 
Thr Tyr Gly 
395 

gee etc tea 
Ala Leu Ser 
410 

acc ttg tta 
Thr Leu Leu 
425 

agt atg aag 
Ser Met Lys 



aac atg ttt 
Asn Met Phe 



cag ate etc 
Gin lie Leu 
475 

ata gat ctg 
lie Asp Leu 
490 

ate tac tat 
He Tyr Tyr 
505 

ggc atg gga 
Gly Met Gly 



tat ttg cca 
Tyr Leu Pro 



teg gtc tta 
Ser Val Leu 
555 



eta 1636 

Leu 

380 

get 1684 
Ala 



ggg 1732 
Gly 



gtg 1780 
Val 



cag 1828 
Gin 



gag 1876 

Glu 

460 

aac 1924 
Asn 



act 1972 
Thr 



gga 2020 
Gly 



gtc 2068 
Val 



cag 2116 

Gin 

540 

tgg 2164 
Trp 
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cct tgg teg ggc tac ctg gec ate tec att tct gtg acc aag aaa gcg 
Pro Trp Ser Gly Tyr Leu Ala lie Ser lie Ser Val Thr Lys Lys Ala 
560 565 570 

get tec tgg gaa ggc att get cag ggc cat gtc atg ate act gtg get 
Ala Ser Trp Glu Gly He Ala Gin Gly His Val Met He Thr Val Ala 
575 580 585 

tec cca gca gag aca gag tea aaa aat ggt gca gaa cag act tea aca 
Ser Pro Ala Glu Thr Glu Ser Lys Asn Gly Ala Glu Gin Thr Ser Thr 
590 595 600 

gta aag etc ccc att aag gtg aag ata att cct act ccc ccg cga age 
Val Lys Leu Pro He Lys Val Lys He He Pro Thr Pro Pro Arg Ser 
605 610 615 620 

aag aga gtt etc tgg gat cag tac cac aac etc cgc tat cca cct ggc 
Lys Arg Val Leu Trp Asp Gin Tyr His Asn Leu Arg Tyr Pro Pro Gly 
625 630 635 

tat ttc ccc agg gat aat tta agg atg aag aat gac cct tta gac tgg 
Tyr Phe Pro Arg Asp Asn Leu Arg Met Lys Asn Asp Pro Leu Asp Trp 
640 645 650 

aat ggt gat cac ate cac acc aat ttc agg gat atg tac cag cat ctg 
Asn Gly Asp His He His Thr Asn Phe Arg Asp Met Tyr Gin His Leu 
655 660 665 

aga age atg ggc tac ttt gta gag gtc etc ggg gee ccc ttc acg tgt 
Arg Ser Met Gly Tyr Phe Val Glu Val Leu Gly Ala Pro Phe Thr Cys 
670 675 680 

ttt gat gee agt cag tat ggc act ttg ctg atg gtg gac agt gag gag 
Phe Asp Ala Ser Gin Tyr Gly Thr Leu Leu Met Val Asp Ser Glu Glu 
685 690 695 700 

gag tac ttc cct gaa gag ate gec aag etc egg agg gac gtg gac aac 
Glu Tyr Phe Pro Glu Glu He Ala Lys Leu Arg Arg Asp Val Asp Asn 
705 710 715 

ggc etc teg etc gtc ate ttc agt gac tgg tac aac act tct gtt atg 
Gly Leu Ser Leu Val He Phe Ser Asp Trp Tyr Asn Thr Ser Val Met 
720 725 730 

aga aaa gtg aag ttt tat gat gaa aac aca agg cag tgg tgg atg ccg 
Arg Lys Val Lys Phe Tyr Asp Glu Asn Thr Arg Gin Trp Trp Met Pro 
735 740 745 
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gat acc gga 
Asp Thr Gly 
750 

tgg aac atg 
Trp Asn Met 
765 

gcc aac cat 
Ala Asn His 



cca gaa gat 
Pro Glu Asp 



gag gtt tta 
Glu Val Leu 
815 

gga ctt tat 
Gly Leu Tyr 
830 

ggg gac tec 
Gly Asp Ser 
845 

tgg ctt ctg 
Trp Leu Leu 



cct age etc 
Pro Ser Leu 



ggc tea gtc 
Gly Ser Val 
895 

tec aag gtt 
Ser Lys Val 
910 

cca gcc tgt 
Pro Ala Cys 
925 



gga get aac 
Gly Ala Asn 



ggg ttc age 
Gly Phe Ser 
770 

gac atg tat 
Asp Met Tyr 
785 

ggc gtc gtg 
Gly Val Val 
800 

aag cag gaa 
Lys Gin Glu 



cag att cca 
Gin lie Pro 



aat tgc ttg 
Asn Cys Leu 
850 

gat gcc etc 
Asp Ala Leu 
865 

agt cac tct 
Ser His Ser 
880 

act cca gag 
Thr Pro Glu 



ctg gag gcc 
Leu Glu Ala 



cca cgc ttg 
Pro Arg Leu 
930 



ate cca get 
lie Pro Ala 
755 

gat ggc ctg 
Asp Gly Leu 



tat gcg tea 
Tyr Ala Ser 



ata aca cag 
lie Thr Gin 
805 

aca gca gtt 
Thr Ala Val 
820 

get gag ggt 
Ala Glu Gly 
835 

gat gac agt 
Asp Asp Ser 



etc cag tac 
Leu Gin Tyr 



ggg aac cgc 
Gly Asn Arg 
885 

agg atg gaa 
Arg Met Glu 
900 

cat ttg gga 
His Leu Gly 
915 

tct tgg gcc 
Ser Trp Ala 



ctg aat gag 
Leu Asn Glu 
760 

tat gaa ggg 
Tyr Glu Gly 
775 

ggg tgc age 
Gly Cys Ser 
790 

act ttc aag 
Thr Phe Lys 



gtt gaa aac 
Val Glu Asn 



gga ggc egg 
Gly Gly Arg 
840 

cac cga cag 
His Arg Gin 
855 

aca teg tat 
Thr Ser Tyr 
870 

cag cgc cct 
Gin Arg Pro 



gga aac cat 
Gly Asn His 



gac cca aaa 
Asp Pro Lys 
920 

aag cca cag 
Lys Pro Gin 
935 



ctg ctg tct 
Leu Leu Ser 



gag ttc acc 
Glu Phe Thr 



ate gcg aag 
lie Ala Lys 
795 

gac caa gga 
Asp Gin Gly 
810 

gtc ccc att 
Val Pro He 
825 

att gta ctg 
He Val Leu 



aag gac tgc 
Lys Asp Cys 



ggg gtg aca 
Gly Val Thr 
875 

ccc agt gga 
Pro Ser Gly 
890 

ctt cat egg 
Leu His Arg 
905 

cct egg cct 
Pro Arg Pro 



cct tta aac 
Pro Leu Asn 



gtg 2788 
Val 



ctg 2836 

Leu 

780 

ttt 2884 
Phe 



ttg 2932 
Leu 



ttg 2980 
Leu 



tat 3028 
Tyr 



ttt 3076 

Phe 

860 

ccg 3124 
Pro 



gca 3172 
Ala 



tac 3220 
Tyr 

eta 3268 
Leu 



gag 3316 

Glu 

940 
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acg gcg 
Thr Ala 



ccc agt aac ctt tgg aaa 
Pro Ser Asn Leu Trp Lys 
945 



cat cag aag eta etc tec att gac 
His Gin Lys Leu Leu Ser lie Asp 
950 955 



3364 



ctg gac aag gtg gtg tta ccc aac ttt cga teg aat cgc cct caa gtg 3412 
Leu Asp Lys Val Val Leu Pro Asn Phe Arg Ser Asn Arg Pro Gin Val 
960 965 970 

agg ccc ttg tec cct gga gag age ggc gee tgg gac att cct gga ggg 34 60 
Arg Pro Leu Ser Pro Gly Glu Ser Gly Ala Trp Asp lie Pro Gly Gly 
975 980 985 

ate atg cct ggc cgc tac aac cag gag gtg ggc cag ace att cct gtc 3508 
lie Met Pro Gly Arg Tyr Asn Gin Glu Val Gly Gin Thr lie Pro Val 
990 995 1000 

ttt gee ttc ctg gga gee atg gtg gtc ctg gee ttc ttt gtg gta caa 3556 
Phe Ala Phe Leu Gly Ala Met Val Val Leu Ala Phe Phe Val Val Gin 
1005 1010 1015 1020 

ate aac aag gec aag age agg ccg aag egg agg aag ccc agg gtg aag 3604 
lie Asn Lys Ala Lys Ser Arg Pro Lys Arg Arg Lys Pro Arg Val Lys 
1025 1030 1035 

cgc ccg cag etc atg cag cag gtt cac ccg cca aag acc cct teg gtg 3652 
Arg Pro Gin Leu Met Gin Gin Val His Pro Pro Lys Thr Pro Ser Val 
1040 1045 1050 

tgaccggcag cctggctgac cgtgagggcc agagagagee ttcacggacg gcgctggtgg 3712 

gtgagecgag ctgtggtggc ggctggttta aaagggatcc agtttccagc tgcaggtttg 3772 

ttagagtctg ttctacatgg gcctgccctc ctgtgatggg cagaggctcc tggtacatcg 3832 

agaagattcc tgtggatccc gtcaggaggg acttagtggc tctgccgcca gtgagacttc 3892 

ccgccggcag ctgtgcgcac caaagactcg ggagaactgg aaaggctgtc tggggtcttc 3952 

tgactgeagg ggaaggatgt actttccaaa caaatgatac aaccctgacc aagctaaaag 4 012 

acgcttgtta aaggctattt tctatattta ttgttgggaa aagtcacttt aaagacttgt 4072 

gctatttgga agcaaagcta ttttttttgt cagtggaatg cagttttttt actattccat 4132 

catgaggaac aacatagatt ccatgatctt tttaatgaca gtacagactg agatttgaag 4192 

gaaacatgea caaatctgta aaacatagac ettegcttta tttttgtaag tatcacctgc 4252 
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caccatgttt tgtaatttga ggtcttgatt tcaccattgt cggtgaagaa aattttcaat 4312 



aaatatgtat tacccgtctg aagctt 



<210> 6 
<211> 1052 
<212> PRT 

<213> Homo sapiens 



<400> 6 

Met Lys Leu Val Asn lie Trp Leu Leu Leu Leu Val Val Leu Leu Cys 



Gly Lys Lys His Leu Gly Asp Arg Leu Glu Lys Lys Ser Phe Glu Lys 



Ala Pro Cys Pro Gly Cys Ser His Leu Thr Leu Lys Val Glu Phe Ser 
35 40 45 

Ser Thr Val Val Glu Tyr Glu Tyr He Val Ala Phe Asn Gly Tyr Phe 
50 55 60 

Thr Ala Lys Ala Arg Asn Ser Phe He Ser Ser Ala Leu Lys Ser Ser 



Glu Val Asp Asn Trp Arg He He Pro Arg Asn Asn Pro Ser Ser Asp 
85 90 95 

Tyr Pro Ser Asp Phe Glu Val He Gin He Lys Glu Lys Gin Lys Ala 
100 105 110 

Gly Leu Leu Thr Leu Glu Asp His Pro Asn He Lys Arg Val Thr Pro 
115 120 125 

Gin Arg Lys Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp Pro Thr 
130 135 140 

Val Pro Cys Asn Glu Thr Arg Trp Ser Gin Lys Trp Gin Ser Ser Arg 
145 150 155 160 

Pro Leu Arg Arg Ala Ser Leu Ser Leu Gly Ser Gly Phe Trp His Ala 
165 170 175 

Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala He Pro Arg Gin 
180 185 190 

Val Ala Gin Thr Leu Gin Ala Asp Val Leu Trp Gin Met Gly Tyr Thr 
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195 



200 



205 



Gly Ala Asn Val 
210 

His Pro His Phe 
225 

Arg Thr Leu Asp 



lie Ala Ser Met 
260 

His lie Phe Arg 
275 

Phe Leu Asp Ala 
290 

Asn Leu Ser lie 
305 

Lys Val Trp Glu 



Gly Asn Asp Gly 
340 

Met Asp Val lie 
355 

Arg Phe Ser Ser 
370 

Gly Arg Met Lys 
385 

Ser Gly Val Lys 



Ser Pro Val Val 
420 

Lys Arg Glu Leu 
435 

Ser Ala Arg Arg 



Arg Val Ala Val 
215 

Lys Asn Val Lys 
230 

Asp Gly Leu Gly 
245 

Arg Glu Cys Gin 



Val Phe Thr Asn 
280 

Phe Asn Tyr Ala 
295 

Gly Gly Pro Asp 
310 

Leu Thr Ala Asn 
325 

Pro Leu Tyr Gly 



Gly Val Gly Gly 
360 

Arg Gly Met Thr 
375 

Pro Asp lie Val 
390 

Gly Gly Cys Arg 
405 

Ala Gly Ala Val 



Val Asn Pro Ala 
440 

Leu Pro Gly Val 



Phe Asp Thr Gly 
220 

Glu Arg Thr Asn 
235 

His Gly Thr Phe 
250 

Gly Phe Ala Pro 
265 

Asn Gin Val Ser 



lie Leu Lys Lys 
300 

Phe Met Asp His 
315 

Asn Val lie Met 
330 

Thr Leu Asn Asn 
345 

lie Asp Phe Glu 



Thr Trp Glu Leu 
380 

Thr Tyr Gly Ala 
395 

Ala Leu Ser Gly 
410 

Thr Leu Leu Val 
425 

Ser Met Lys Gin 



Asn Met Phe Glu 



Leu Ser Glu Lys 



Trp Thr Asn Glu 
240 

Val Ala Gly Val 
255 

Asp Ala Glu Leu 
270 

Tyr Thr Ser Trp 
285 

lie Asp Val Leu 



Pro Phe Val Asp 
320 

Val Ser Ala lie 
335 

Pro Ala Asp Gin 
350 

Asp Asn lie Ala 
365 

Pro Gly Gly Tyr 



Gly Val Arg Gly 
400 

Thr Ser Val Ala 
415 

Ser Thr Val Gin 
430 

Ala Leu lie Ala 
445 

Gin Gly His Gly 
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450 



455 



460 



Lys Leu Asp Leu 
465 

Gin Ala Ser Leu 



Met Trp Pro Tyr 
500 

Val Val Asn Val 
515 

Val Asp Lys Pro 
530 

lie Glu Val Ala 
545 

Tyr Leu Ala lie 



Gly He Ala Gin 
580 

Thr Glu Ser Lys 
595 

He Lys Val Lys 
610 

Trp Asp Gin Tyr 
625 

Asp Asn Leu Arg 



He His Thr Asn 
660 

Tyr Phe Val Glu 
675 

Gin Tyr Gly Thr 
690 

Glu Glu He Ala 



Leu Arg Ala Tyr 
470 

Ser Pro Ser Tyr 
485 

Cys Ser Gin Pro 



Thr He Leu Asn 
520 

Asp Trp Gin Pro 
535 

Phe Ser Tyr Ser 
550 

Ser He Ser Val 
565 

Gly His Val Met 



Asn Gly Ala Glu 
600 

He He Pro Thr 
615 

His Asn Leu Arg 
630 

Met Lys Asn Asp 
645 

Phe Arg Asp Met 



Val Leu Gly Ala 
680 

Leu Leu Met Val 
695 

Lys Leu Arg Arg 



Gin He Leu Asn 
475 

He Asp Leu Thr 
490 

He Tyr Tyr Gly 
505 

Gly Met Gly Val 



Tyr Leu Pro Gin 
540 

Ser Val Leu Trp 
555 

Thr Lys Lys Ala 
570 

He Thr Val Ala 
585 

Gin Thr Ser Thr 



Pro Pro Arg Ser 
620 

Tyr Pro Pro Gly 
635 

Pro Leu Asp Trp 
650 

Tyr Gin His Leu 
665 

Pro Phe Thr Cys 



Asp Ser Glu Glu 
700 

Asp Val Asp Asn 



Ser Tyr Lys Pro 
480 

Glu Cys Pro Tyr 
495 

Gly Met Pro Thr 
510 

Thr Gly Arg He 
525 

Asn Gly Asp Asn 



Pro Trp Ser Gly 
560 

Ala Ser Trp Glu 
575 

Ser Pro Ala Glu 
590 

Val Lys Leu Pro 
605 

Lys Arg Val Leu 



Tyr Phe Pro Arg 
640 

Asn Gly Asp His 
655 

Arg Ser Met Gly 
670 

Phe Asp Ala Ser 
685 

Glu Tyr Phe Pro 



Gly Leu Ser Leu 
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Val lie Phe Ser Asp Trp Tyr Asn Thr Ser Val Met Arg Lys Val Lys 
725 730 735 

Phe Tyr Asp Glu Asn Thr Arg Gin Trp Trp Met Pro Asp Thr Gly Gly 
740 745 750 

Ala Asn lie Pro Ala Leu Asn Glu Leu Leu Ser Val Trp Asn Met Gly 
755 760 765 

Phe Ser Asp Gly Leu Tyr Glu Gly Glu Phe Thr Leu Ala Asn His Asp 
770 775 780 

Met Tyr Tyr Ala Ser Gly Cys Ser lie Ala Lys Phe Pro Glu Asp Gly 
785 790 795 800 

Val Val lie Thr Gin Thr Phe Lys Asp Gin Gly Leu Glu Val Leu Lys 
805 810 815 

Gin Glu Thr Ala Val Val Glu Asn Val Pro lie Leu Gly Leu Tyr Gin 
820 825 830 

lie Pro Ala Glu Gly Gly Gly Arg lie Val Leu Tyr Gly Asp Ser Asn 
835 840 845 

Cys Leu Asp Asp Ser His Arg Gin Lys Asp Cys Phe Trp Leu Leu Asp 
850 855 860 

Ala Leu Leu Gin Tyr Thr Ser Tyr Gly Val Thr Pro Pro Ser Leu Ser 
865 870 875 880 

His Ser Gly Asn Arg Gin Arg Pro Pro Ser Gly Ala Gly Ser Val Thr 
885 890 895 

Pro Glu Arg Met Glu Gly Asn His Leu His Arg Tyr Ser Lys Val Leu 
900 905 910 

Glu Ala His Leu Gly Asp Pro Lys Pro Arg Pro Leu Pro Ala Cys Pro 
915 920 925 

Arg Leu Ser Trp Ala Lys Pro Gin Pro Leu Asn Glu Thr Ala Pro Ser 
930 935 940 

Asn Leu Trp Lys His Gin Lys Leu Leu Ser lie Asp Leu Asp Lys Val 
945 950 955 960 

Val Leu Pro Asn Phe Arg Ser Asn Arg Pro Gin Val Arg Pro Leu Ser 
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965 



970 



975 



Pro Gly Glu Ser Gly Ala Trp Asp lie Pro Gly Gly lie Met Pro Gly 
980 985 990 

Arg Tyr Asn Gin Glu Val Gly Gin Thr lie Pro Val Phe Ala Phe Leu 
995 1000 1005 

Gly Ala Met Val Val Leu Ala Phe Phe Val Val Gin lie Asn Lys Ala 
1010 1015 1020 

Lys Ser Arg Pro Lys Arg Arg Lys Pro Arg Val Lys Arg Pro Gin Leu 
025 1030 1035 1040 

Met Gin Gin Val His Pro Pro Lys Thr Pro Ser Val 
1045 1050 



<210> 7 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (2) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa represents an alkyl or an aromatic hydrophobic 
amino acid. 

<220> 

<221> MOD_RES 
<222> (4) . . (6) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (7) 

<223> Xaa represents an acidic amino acid. 
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<400> 7 

Arg Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



<210> 8 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (2) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa represents an alkyl or an aromatic hydrophobe 
amino acid. 

<220> 

<221> MOD_RES 
<222> (4) 

<223> Xaa represents Lys, Leu, Phe or Thr. 
<220> 

<221> MOD_RES 
<222> (5) . . (6) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (7) 

<223> Xaa represents an acidic amino acid. 
<400> 8 

Arg Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



<210> 9 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (2) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa represents an alkyl or an aromatic hydrophobic 
acid. 

<220> 

<221> MOD_RES 
<222> (4) . . (7) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (8) 

<223> Xaa represents an acidic amino acid. 
<400> 9 

Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



<210> 10 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> M0D_RES 
<222> (2) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa represents an alkyl or an aromatic hydrophobic 
amino acid. 
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<220> 

<221> MOD_RES 
<222> (4) 

<223> Xaa represents Lys, Leu, Phe or Thr. 
<220> 

<221> MOD_RES 
<222> (5) . . (7) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (8) 

<223> Xaa represents an acidic amino acid. 
<400> 10 

Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



<210> 11 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (2) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa is an alkyl or an aromatic hydrophobic amino 
acid. 

<220> 

<221> M0D_RES 
<222> (4) . . (8) 

<223> Xaa represents any amino acid. 
<220> 

<221> M0D_RES 
<222> (9) 
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<223> Xaa represents an acidic amino acid. 



<400> 11 

Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



<210> 12 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (2) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (3) 

<223> Xaa represents an alkyl or an aromatic hydrophobic 
amino acid. 

<220> 

<221> MOD_RES 
<222> (4) 

<223> Xaa represents Lys, Leu, Phe or Thr. 
<220> 

<221> MOD_RES 
<222> (5) . . (8) 

<223> Xaa represents any amino acid. 
<220> 

<221> MOD_RES 
<222> (9) 

<223> Xaa represents an acid amino acid. 
<400> 12 

Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



<210> 13 
<211> 11 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 13 

Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp 
15 10 



<210> 14 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (1) 

<223> Xaa represents orthoaminobenzoic acid. 
<220> 

<221> MOD_RES 
<222> (13) 

<223> Xaa represents 3-nitrotyrosine . 
<400> 14 

Xaa Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp Xaa 
15 10 



<210> 15 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<220> 

<221> modified_base 
<222> (3) 
<223> i 
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<220> 

<221> modif ied_base 
<222> (9) 
<223> i 

<220> 

<221> modif iedjoase 
<222> (12) 
<223> i 

<220> 

<221> modified_base 
<222> (18) 
<223> i 

<220> 

<221> modif ied_base 
<222> (21) 
<223> i 

<400> 15 

ggncayggna cnywykkngc ngg 23 



<210> 16 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<220> 

<221> modified_base 
<222> (3) 
<223> i 

<220> 

<221> modified_base 
<222> (6) 
<223> i 

<220> 

<221> modified_base 
<222> (9) 
<223> i 
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<220> 

<221> modified_base 
<222> (12) 
<223> i 

<220> 

<221> modified_base 
<222> (15) 
<223> i 

<220> 

<221> modified_base 
<222> (18) 
<223> i 

<220> 

<221> modified_base 
<222> (21) 
<223> i 

<220> 

<221> modified_base 
<222> (24) 
<223> i 

<220> 

<221> modified_base 
<222> (29) 
<223> i 

<400> 16 

ccngynacnw snggnswngc nacnswgtnc c 31 

<210> 17 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (5) 

<223> Xaa represents histidine or phenylalanine. 
<220> 
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<221> MOD_RES 
<222> (6) 

<223> Xaa represents valine or cysteine. 
<400> 17 

Gly His Gly Thr Xaa Xaa Ala Gly 
1 5 



<210> 18 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> MOD_RES 
<222> (4) 

<223> Xaa represents valine or methionine. 
<220> 

<221> MOD_RES 
<222> (6) 

<223> Xaa represents threonine or serine. 
<220> 

<221> MOD_RES 
<222> (8) 

<223> Xaa represents histidine or valine. 
<220> 

<221> MOD_RES 
<222> (10) 

<223> Xaa represents alanine or threonine. 
<400> 18 

Gly Thr Ser Xaa Ala Xaa Pro Xaa Val Xaa Gly 
15 10 



<210> 19 

<211> 28 

<212> DNA 

<213> Homo sapiens 

<400> 19 
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ggatccgaag aaacatctgg gcgacaga 



28 



<210> 20 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 20 

ctcgagggct ctcagccgtg tgct 



<210> 21 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 21 

gaggaagaga cagggataaa c 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 22 

gggatatgct tagcattgac 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 23 
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agccctatta cctgaacctg 



20 



<210> 24 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 24 

gaatctgaaa gaactccccc 20 



<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 25 

ttccgagatt ccatcctacg 20 



<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 26 

tgcagctcag caggtctatg 20 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: 
Oligonucleotide 



<400> 27 

tctcctccaa cctcaaccac 



<210> 28 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 28 

ccagcctgtc atcctcaata tc 22 



<210> 29 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 29 

ggagccatgg attgcacttt c 21 



<210> 30 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 30 

aggagctcaa tgtggcagga 20 



<210> 31 
<211> 27 
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<212> DNA 

<213> Homo sapiens 



<400> 31 

gtgaccatga agcttgtcaa catctgg 



<210> 32 

<211> 26 

<212> DNA 

<213> Homo sapiens 

<400> 32 

acactggtcc ctgagagggc ccggc. 



<210> 33 

<211> 21 

<212> DNA 

<213> Homo sapiens 



<400> 33 

attgacctgg acaaggtggt 



<210> 34 

<211> 57 

<212> DNA 

<213> Homo sapiens 



<400> 34 

ggatcctcta gatcagtggt ggtggtggtg gtggtgctcc tggttgtagc ggccagi 



<210> 35 

<211> 24 

<212> DNA 

<213> Homo sapiens 



<400> 35 

ctcgagggag aggctggctc ttcg 



<210> 36 

<211> 28 

<212> DNA 

<213> Homo sapiens 
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<400> 36 

ctcgagtgtc tgggcaacct ggcgcggg 



28 



<210> 37 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 37 

Lys Ala Gly Ser Arg Gly Leu Thr Ser Leu Ala Asp Thr Phe 
15 10 



<210> 38 
<211> 27 
<212> PRT 

<213> Homo sapiens 
<400> 38 

Gly Gly Ala His Asp Ser Asp Gin His Pro His Ser Gly Ser Gly Arg 
15 10 15 

Ser Val Leu Ser Phe Glu Ser Gly Ser Gly Gly 
20 25 



<210> 39 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 39 

Trp His Ala Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala lie 
15 10 15 

Pro Arg 



<210> 40 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
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<400> 40 

Trp His Ala Thr Gly Arg His Ser Ser Arg Arg Leu Leu Arg Ala Leu 
15 10 15 



Glu 



<210> 41 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 41 

Ser Arg Arg Leu Leu Arg Ala Leu Glu 
1 5 



<210> 42 
<211> 17 
<212> PRT 

<213> Homo sapiens 
<400> 42 

Trp Gin Ser Ser Arg Pro Leu Arg Arg Ala Ser Leu Ser Leu Gly Ser 



Gly 



<210> 43 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 43 

Arg Ala lie Pro Arg Gin Val Ala Gin Thr Leu Gin Ala Asp Val 
15 10 15 



<210> 44 
<211> 9 
<212> PRT 

<213> Homo sapiens 
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<400> 44 

Pro Gin Arg Lys Val Phe Arg Ser Leu 
1 5 



<210> 45 
<211> 15 
<212> PRT 

<213> Homo sapiens 
<400> 45 

Pro Gin Arg Lys Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp 
15 10 15 



<210> 46 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> (1) 

<223> Xaa represents orthoaminobenzoic acid. 
<220> 

<221> MOD_RES 
<222> (13) 

<223> Xaa represents 3-nitrotyrosine . 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 46 

Xaa Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp Xaa Ala 
15 10 



<210> 47 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> (1) 

<223> Xaa represents orthoaminobenzoic acid. 
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<220> 

<221> MOD_RES 
<222> (11) 

<223> Xaa represent 3-nitrotyrosine . 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 47 

Xaa Arg Ser Leu Lys Tyr Ala Glu Ser Asp Xaa Ala 
15 10 



<210> 48 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 48 

Lys Ala Gly Ser Arg Gly Leu Thr Ser Leu Ala Asp Thr Phe Glu His 
1 5 10 15 



<210> 49 
<211> 16 
<212> PRT 
<213> Rattus sp. 



<400> 49 

Lys Ala Gly Ser Arg Gly Leu Thr Thr Thr Ser Leu Ala Asp Thr Phe 
1 5 10 15 



<210> 50 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Arg His Ser Ser Arg Arg Leu Leu Arg Ala He Pro Arg Gin Val Ala 
1 5 10 15 



<210> 51 
<211> 16 
<212> PRT 

<213> Homo sapiens 



Arg Lys Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp Pro Thr Val 
1 5 10 15 



<210> 52 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 52 

Thr Pro Gin Arg Lys Val Phe Arg Ser Leu Lys Tyr Ala Glu Ser Asp 



<210> 53 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 53 

Val Thr Pro Gin Arg Lys Val Phe Arg Ser Leu Lys Lys Tyr Ala Glu 



<210> 54 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 54 

Ser Gly Ser Gly Arg Ser Val Leu Ser Phe Glu Ser Gly Ser Gly Gly 



<210> 55 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 55 

His Ser Pro Gly Arg Asn Val Leu Gly Thr Glu Ser Arg Asp Gly Pro 
1 5 10 15 



<210> 56 
<211> 16 
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<212> PRT 

<213> Rattus sp. 

<400> 56 

Ala Ser Val Gly Arg Leu Ala Leu Ser Gin Glu Glu Pro Ala Pro Leu 
15 10 15 



<210> 57 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 57 

Arg lie Ser Asp Arg Asp Tyr Met Gly Trp Met Asp Phe Gly Arg Arg 
15 10 15 



<210> 58 

<211> 16 

<212> PRT 

<213> Rattus sp. 

<400> 58 

Asp Pro Arg Leu Arg Gin Phe Leu Gin Lys Ser Leu Ala Ala Ala Thr 
15 10 15 



<210> 59 
<211> 16 
<212> PRT 
<213> Bovi 



<400> 59 

Leu Leu Lys Glu Leu Gin Asp Leu Ala Leu Gin Gly Ala Lys Glu Arg 



<210> 60 
<211> 16 
<212> PRT 
<213> Bovis sp. 

<400> 60 

Met Ala Arg Ala Pro Gin Val Leu Phe Arg Gly Gly Lys Ser Gly Glu 
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<210> 61 
<211> 16 
<212> PRT 
<213> Bovis sp. 

<400> 61 

Glu Leu Glu Asn Leu Ala Ala Met Asp Leu Glu Leu Gin Lys lie Ala 
15 10 15 



<210> 62 
<211> 16 
<212> PRT 
<213> Bovis sp. 

<400> 62 

Ala Ala Met Asp Leu Glu Leu Gin Lys lie Ala Glu Lys Phe Ser Gly 



<210> 63 

<211> 16 

<212> PRT 

<213> Rattus sp. 

<400> 63 

Lys Ser Ser Phe Thr Asn Val Thr Ser Pro Val Val Leu Thr Asn Tyr 



<210> 64 
<211> 16 
<212> PRT 
<213> Rattus sp. 

<400> 64 

Lys Ser Gin Thr Pro Leu Val Thr Leu Phe Lys Asn Ala lie lie Lys 
15 10 15 



<210> 65 
<211> 16 
<212> PRT 
<213> Rattus sp. 

<400> 65 

Ser Gin Thr Pro Leu Val Thr Leu Phe Lys Asn Ala lie He Lys Asn 
15 10 15 
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<210> 66 
<211> 16 
<212> PRT 
<213> Rattus sp. 

<400> 66 

Gly Pro Ala Arg Glu Leu Leu Leu Arg Leu Val Gin Leu Ala Gly Thr 
15 10 15 



<210> 67 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 67 

Leu Leu Arg Lys Lys Arg Thr Thr Ser Ala Glu Lys Asn Thr Cys Gin 



<210> 68 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 68 

Glu Glu lie Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser 
15 10 15 



<210> 69 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 69 

Glu Glu lie Ser Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser 



<210> 70 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 70 
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lie Ser Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr 
15 10 15 



<210> 71 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 71 

Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu Val 
15 10 15 



<210> 72 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 72 

Ser Ser Arg Arg Leu Leu Arg Ala lie Glu 
15 10 



<210> 73 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 73 

Ser Gly Ser Gly Arg Ser Val Leu Ser Phe Glu Ser 
15 10 



<210> 74 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> (1) 

<223> Xaa represents orthoaminobenzoic acid. 
<220> 
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<221> MOD_RES 
<222> (13) 

<223> Xaa represents 3-nitrotyrosine . 
<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 74 

Xaa Arg His Ser Ser Arg Arg Leu Leu Arg Ala lie Xaa Ala 
15 10 



<210> 75 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peptide 
<220> 

<221> M0D_RES 
<222> (1) 

<223> Xaa represents orthoaminobenzoic acid. 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Xaa represents 3-nitrotyrosine. 
<400> 75 

Xaa Ser Arg Arg Leu Leu Arg Ala Leu Glu Xaa Ala 
15 10 



<210> 76 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<221> MOD_RES 
<222> (1) 

<223> Xaa represents orthoaminobenzoic acid. 
<220> 

<221> MOD_RES 
<222> (14) 
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<223> Xaa represents 3-nitrotyrosine . 



<220> 

<223> Description of Artificial Sequence: Peptide 
<400> 76 

Xaa Asn Gly Pro Lys Ala Gly Ser Arg Gly Leu Thr Ser Xaa Ala 
15 10 15 
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